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10 <130> PF 55443 EP 

<160> 16 

<170> Patentln version* 3.1 

<210> 1 

<211> 1160 

<212> DNA 

<213> Rhodosporidium toruloides 

<220> 

<221> CDS 

<222> (1)..(1104) 

<223>~ coding for DAAO 

<400> 1 

atg cac teg cag aag cgc gtc gtt gtc etc gga tea ggc gtt ate ggt 4 8 

Met His Ser Gin Lys Arg Val Val Val Leu Gly Ser Gly Val lie Gly 
15 10 15 

■ 

ctg age age gee etc ate etc get egg aag. ggc tac age gtg cat att 96 
Leu Ser Ser Ala Leu lie" Leu Ala Arg Lys Gly Tyr Ser Val His lie 

20 • 25 30 

35 etc gcg cgc gac ttg ccg gag gac gtc teg age cag act ttc get tea 144 
Leu Ala Arg Asp Leu Pro Glu Asp Val Ser Ser Gin Thr Phe Ala Ser 

40 45 



cca tgg get ggc gcg aat tgg acg cct ttc atg acg ctt aca gac ggt 192 

40 Pro Trp Ala Gly Ala Asn Trp Thr Pro Phe Met Thr Leu Thr Asp Gly 
50 55 60 

cct cga caa gca aaa tgg gaa gaa teg act ttc aag aag tgg gtc gag 24 0 

Pro Arg Gin Ala Lys Trp Glu Glu Ser Thr Phe Lys Lys Trp Val Glu 

45 65 70 75 80 
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ttg gtc ccg acg ggc cat gcc atg tgg etc aag ggg acg agg egg ttc 288 
Leu Val Pro Thr Gly His Ala Met Trp Leu Lys Gly Thr Arg Arg Phe 

85 90 95 



gcg cag aac gaa gac ggc ttg etc ggg cac tgg tac aag gac ate acg 
Ala Gin Asn Glu Asp Gly Leu Leu Gly His Trp Tyr Lys Asp He Thr 

100 105 HO 

■ 

10 cca aat tac cgc ccc etc cca tct tec gaa tgt cca cct ggc get ate 
Pro Asn Tyr Arg Pro Leu Pro Ser Ser Glu Cys Pro Pro Gly Ala He 
115 120 125 

ggc gta acc tac gac acc etc tec gtc cac gca cca aag tac tgc cag 
15 Gly Val Thr Tyr Asp Thr Leu Ser Val His Ala Pro Lys Tyr Cys Gin 
130 135 140 

♦ 

tac ctt gca aga gag ctg cag aag etc ggc gcg acg ttt gag aga egg 
Tyr Leu Ala Arg Glu Leu Gin Lys Leu Gly Ala Thr Phe Glu Arg Arg 
20 145 150 155 160 

acc gtt acg teg ctt gag cag gcg ttc gac ggt gcg gat ttg gtg gtc 
Thr Val Thr Ser Leu Glu Gin Ala Phe Asp * Gly Ala Asp Leu Val Val 

165 170 175 

25 

aac get acg gga ctt ggc gcc aag teg att gcg ggc ate gac gac caa 
Asn 'Ala Thr Gly Leu Gly Ala Lys Ser He Ala Gly He Asp Asp Gin 

180 IBS 190 

30 gcc gcc gag cca ate cgc ggg caa acc gtc etc gtc aag tec cca tgc 
Ala Ala Glu Pro * He Arg Gly Gin Thr Val Leu Val Lys Ser Pro Cys 
195 200 - 205 



336 



384 



432 



480 



528 



57 6 



624 



aag cga tgc acg atg 
35 Lys Arg Cys Thr -Met 
210 

att ccc cga cca ggt 
He Pro Arg Pro Gly 
40 225 

gga gac tgg gac ttg 
Gly Asp Trp Asp Leu 

245 

45 



gac teg tec gac ccc get tct 
Asp Ser Ser Asp Pro Ala Ser 
215 220 

ggc gaa gtc ate tgc ggc ggg 
Gly Glu Val He Cys Gly Gly 
230 235 

tct gtc aac cca gag acg gtc 
Ser Val Asn Pro Glu Thr Val 

250 



ccc gcc tac ate 672 
Pro Ala Tyr He 

acg tac ggc gtg 720 
Thr Tyr Gly Val 

240 

cag egg ate etc 768 
Gin Arg He Leu 
255 
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* 

aag cac tgc ttg cgc etc gac ccg acc ate teg age gac gga acg ate 816 

Lys His Cys Leu Arg Leu Asp Pro Thr lie Ser Ser Asp Gly Thr He 

260 265 270 

5 gaa ggc ate gag gtc etc cgc cac aac gtc ggc ttg cga cct gca cga 8 64 

Glu Gly He Glu Val Leu Arg His Asn Val Gly Leu Arg Pro Ala Arg 
275 280 285 

cga ggc gga ccc cgc gtt gag gca gaa egg ate gtc ctg cct etc gac 912 
10 Arg Gly Gly Pro Arg Val Glu Ala Glu Arg lie Val Leu Pro Leu Asp 
290 295 300 

egg aca aag teg ccc etc teg etc ggc agg ggc age gca cga gcg gcg 960 
Arg Thr Lys Ser Pro Leu Ser Leu Gly Arg Gly Ser Ala Arg Ala Ala • 
15 305 310 315 320 

aag gag aag gag gtc acg ctt gtg cat gcg tat ggc ttc teg agt gcg 1008 

Lys Glu Lys Glu Val Thr Leu Val His Ala Tyr Gly Phe Ser Ser Ala 

325- 330 335 

20 

gga tac cag cag agt tgg ggc gcg gcg -gag gat gtc gcg cag etc gtc 1056 

Gly Tyr Gin Gin Ser Trp Gly Ala Ala Glu Asp Val Ala Gin Leu Val 

340 345 350 

a ^ 

25 gac gag gcg- ttc cag egg tac cac ggc gcg gcg egg gag teg aag ttg 1104 
Asp Glu Ala Phe Gin Arg Tyr His Gly Ala Ala Arg Glu Ser Lys Leu 
355 360 365 



30 



tagggeggga tttgtggctg tattgeggge atctacaaga aaaaaaaaaa aaaaaa 1160 



<210> 2 

<211> 36B 

<212> PRT 

35 <213> Rhodosporidium toruloides 

<400> 2 

Met His Ser Gin Lys Arg Val Val Val Leu Gly Ser Gly Val He Gly 
15 10 15 

40 

Leu Ser Ser Ala Leu He Leu Ala Arg Lys Gly Tyr Ser Val His He 

20 25 30 

Leu Ala Arg Asp Leu Pro Glu Asp Val Ser Ser Gin Thr Phe Ala Ser 
45 35 40 45 
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Pro Trp Ala Gly Ala Asn Trp Thr Pro Phe Met Thr Leu Thr Asp Gly 
50 55 60 

5 Pro Arg Gin Ala Lys Trp Glu Glu Ser Thr Phe Lys Lys Trp Val Glu 
65 70 75 80 

Leu Val Pro Thr Gly His Ala Met Trp Leu Lys Gly Thr Arg Axg Phe 

85 90 95 

10 

Ala Gin Asn Glu Asp Gly Leu Leu Gly His Trp Tyr Lys Asp lie Thr 

' 100 * 105 HO 

Pro Asn Tyr Arg Pro Leu Pro Ser Ser Glu Cys Pro Pro Gly Ala lie 
15 115 120 125 

Gly Val Thr Tyr Asp Thr Leu Ser Val His Ala Pro Lys Tyr Cys Gin 
130 135 140 

20 Tyr Leu Ala Arg Glu Leu Gin Lys Leu Gly Ala Thr Phe Glu Arg Arg 
145 150 155 160 

Thr Val Thr Ser - Leu Glu Gin Ala Phe Asp Gly Ala Asp Leu Val Val 

165 170 175 

25 

Asn Ala Thr Gly Leu Gly Ala Lys Ser lie Ala Gly He Asp Asp Gin 

180 185 190 

Ala Ala Glu Pro He Arg Gly Gin Thr Val Leu Val Lys Ser Pro Cys 
30 195 200 205 

Lys Arg Cys Thr Met Asp Ser Ser Asp Pro Ala Ser Pro Ala Tyr He 
210 215 220 

35 He Pro Arg Pro Gly Gly Glu Val He Cys Gly Gly Thr Tyr Gly Val 
225 230 235 240 

> 

Gly Asp Trp Asp Leu Ser Val Asn Pro Glu Thr Val Gin Arg He Leu 

245 250 255 

40 

Lys His Cys Leu Arg Leu Asp Pro Thr He Ser Ser Asp Gly Thr He 

260 265 270 



Glu Gly He felu Val Leu Arg His Asn Val Gly 
45 275 280 



Leu Arg Pro Ala Arg 
285 
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Arg Gly Gly Pro Arg Val Glu Ala Glu Arg lie Val Leu Pro Leu Asp 
290 295 300 

5 Arg Thr Lys Ser Pro Leu Ser Leu Gly Arg Gly Ser Ala Arg Ala Ala 
305 310 315 320 

Lys .Glu Lys Glu Val Thr Leu Val His Ala Tyr Gly Phe Ser Ser Ala 

325 330 335 



10 



35 



Gly Tyr Gin Gin Ser Trp Gly Ala Ala Glu Asp Val Ala Gin Leu Val 

340 345 350 



Asp Glu Ala Phe Gin. Arg Tyr His Gly Ala Ala Arg Glu Ser Lys Leu 
.15 355 360 365 

<210> 3 

<211> 1005 

<212> DNA 

20 <213> Caenorhabditis elegans 

<220> 

<221> CDS 

<222> (1)..(1002) 

25 <223> coding for DAAO 

<400> 3 

atg gca aac ata att ccg aag att gca att ate ggc gaa gga gtc att 48 

Met Ala Asn He He Pro Lys He Ala He He Gly Glu Gly Val He 
30 1 5 10 15 

gga tgt act tea gca ctt caa.ata.tca aaa get ata cca aat gcg aaa 96 
Gly Cys Thr Ser Ala Leu Gin He Ser Lys Ala He Pro Asn Ala Lys 

20 25 30 



ata act gtg etc cac gat aaa cca ttt aaa aaa teg tgc agt gca gga 144 
He Thr Val Leu His Asp Lys Pro Phe Lys Lys Ser Cys Ser Ala" Gly 
35 40 45 

40 cca gca gga tta ttt aga ate gat tat gag gag aat act gaa tac gga 192 
Pro Ala Gly Leu Phe Arg He Asp Tyr Glu Glu Asn Thr Glu Tyr Gly 
50 55 60 

cgt get tct ttc gec tgg ttc tea cat' etc tat cgc act aca 'aaa gga 240 
45 Arg Ala Ser Phe Ala Trp Phe Ser His Leu Tyr Arg Thr Thr Lys Gly 
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80 



10 



15 



tec gaa acc ggc gtg aaa tta grtt tct gga cat att caa tec gac aac 
Ser .Glu Thr Gly Val Lys Leu Val Ser Gly His He Gin Ser Asp Asn 

85 90 95 

ttg gag tea ttg aag caa caa caa aga gec tat ggc gat att gtg tac 
Leu Glu Ser Leu Lys Gin Gin Gin Arg Ala Tyr Gly Asp He Val Tyr 

100 105 HO 

aac ttt aga ttc ttg gat gat aga gaa egg ctg gac att ttt ccc gaa 
Asn Phe Arg Phe Leu Asp Asp Arg Glu Arg Leu Asp He Phe Pro Glu 
115 120 125 



2BB 



cca tea aag cac tgc att cac tac acc gec tac gca tea gaa ggt 
Pro Ser Lys His Cys He His Tyr Thr Ala Tyr Ala Ser Glu Gly Asn 
130 135 * 140 



336 



384 



432 



aag tac gtg cct tat ttg aag aat ttg ctg ctt gag caa aaa ate gag 
20 Lys Tyr Val Pro Tyr Leu Lys Asn Leu Leu Leu Glu Gin Lys He Glu 
145 150 155 160 



480 



25 



ttc aag caa caa gaa gtg acgagt ttg gac gca gtc gee 
Phe Lys Gin Gin Glu Val Thr Ser Leu Asp Ala Val Ala 

165 170 



get ggt 
Ala Gly 
175 



30 



tac gat gtt att gta aac tgc gca ggc ttg tac ggt gga aag ttg get 

Tyr Asp Val He Val Asn Cys Ala Gly Leu Tyr Gly Gly Lys Leu Ala 

180 185 190 

ggt gat gac gat act tgc tac ccc att aga gga gtc att ttg gaa gtt 

Gly Asp Asp Asp Thr Cys Tyr Pro He Arg Gly Val He Leu. Glu Val 
195 200 205 



528 



576 



624 



35 gat gca cca tgg cac aag cac ttc aat tat cga gac ttt act act ttc 
Asp Ala Pro Trp His Lys 'His Phe Asn Tyr Arg Asp Phe Thr Thr Phe 
210 215 ' 220 



672 



aca att cca aaa gag cac 
40 Thx He Pro Lys Glu His 
225 230 



gtg gtg gtt ggg tec acc aag cag gac 
Val Val Val Gly Ser Thr Lys Gin Asp 

235 240 



720 



45 



cga tgg gat. ttg gag ate acc gac gag gat 
Asn Arg Trp Asp Leu Glu He Thr Asp' Glu Asp 

245 250 



aat gat att ttg 
Asn Asp 'He Leu 
255 



768 
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aaa cga tac att get tta cat cct gga atg aga gag cca aag att ate 816 

Lys Arg Tyr He Ala Leu His Pro Gly Met Arg Glu Pro Lys He He 

260 265 270 

5 

aaa gaa tgg tea gca ctt cgc ccg gga cgt aag cat gtc aga att gaa 864 

Lys Glu Trp Ser Ala Leu Arg Pro Gly Arg Lys His Val Arg He Glu 
275 280 285 

10 gcg cag aag agg aca tct gtt gga aac tea aaa gat tat atg gtt gtg 912 
Ala Gin Lys Arg Thr Ser Val Gly Asn Ser Lys Asp Tyr Met Val Val 
290 295 300 

cat cac tat ggt cac ggg age .aac gga ttc acg ttg ggt tgg gga aca * 960 
15 His His Tyr Gly His Gly Ser Asn Gly Phe Thr Leu Gly Trp Gly Thr 
305 310 315 320 

■ 

gca att gaa gca act aaa ctt gtt aag act gca eta gga tta taa 1005 
Ala lie Glu Ala Thr Lys Leu Val Lys Thr Ala Leu Gly Leu 
20 325 330 * 

■ 

<210> 4 

<211> 334 

<212> PRT / 

<213> Caenorhabditis elegans 

<400> 4 

Met Ala Asn He He Pro Lys He Ala He He Gly Glu Gly Val He . 
1 5 .10 15 

Gly Cys Thr Ser Ala Leu Gin lie Ser Lys Ala He Pro Asn Ala Lys 

20 25' • 30 

■ 

He tfhr Val Leu His Asp Lys Pro Phe Lys Lys "Ser Cys Ser Ala Gly 
35 40 45 

Pro Ala Gly Leu Phe Arg He Asp Tyr Glu Glu Asn Thr Glu Tyr Gly 
50 55 60 

40 Arg Ala Ser Phe Ala Trp Phe Ser His Leu Tyr Arg Thr Thr Lys Gly 
65 70 75 80 



30 



45 



Ser Glu. Thr 



Gly Val Lys Leu Val 
65 



Ser Gly His He Gin Ser Asp Asn 
9(3 95* 
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Leu Glu Ser Leu Lys Gin Gin Gin Arg 

100 105 



Tyr Gly Asp He Val Tyr 

110 



Asn Phe Arg Phe Leu Asp Asp Arg Glu Arg Leu Asp He Phe Pro Glu 
5 115 120 125 

Pro Ser Lys His Cys He His Tyr Thr Ala Tyr Ala Ser Glu Gly Asn 
130 135 140 

10 Lys Tyr Val Pro Tyr Leu Lys Asn Leu Leu Leu Glu Gin Lys He Glu 
145' 150 155 160 

Phe Lys Gin Gin Glu Val Thr Ser Leu Asp Ala Val Ala Asp Ala Gly 

165 170 175 



15 



Tyr Asp Val He Val Asn Cys Ala Gly Leu Tyr Gly Gly Lys Leu Ala 

180 185 190 



Gly Asp Asp Asp Thr Cys Tyr Pro He Arg Gly Val He Leu Glu Val 
20 195 200 205 

Asp Ala Pro Trp His Lys His Phe Asn Tyr Arg Asp Phe Thr Thr Phe 
210 215 220 

25 Thr He Pro Lys Glu His Ser Val Val Val Gly Ser Thr Lys Gin Asp 
225 • 230 235 240 

* 

Asn Arg Trp Asp Leu Glu He. Thr Asp Glu Asp Arg Asn Asp He Leu 

245 250 255 



- 30 



Lys Arg Tyr He Ala Leu His Pro Gly Met Arg Glu Pro Lys He He 

260 265 • 270 



Lys Glu Trp Ser Ala 
35 275 

Ala Gin Lys Arg Thr 
290 



Arg Pro Gly 
280 

Val Gly Asn 
295 



Lys His Val Arg He Glu 
285 

Lys Asp Tyr Met Val Val 
300 



40 His His Tyr Gly His Gly Ser Asn Gly Phe Thr .Leu Gly Trp Gly Thr 
305 310 315 320 



Ala He Glu Ala Thr Lys. Leu Val Lys Thr Ala Leu Gly Leu 

. 325 330* 

45 
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10 



15 



35 



<210> 5 

<211> 118 6 
<212> 

<213> Necrtria haematococca 
<220> 

<221> CDS 

<222> (42) . . (1124) 

<223> coding for DAAO 



40 cag agg tct ggt ttc cca gac gcc etc ttc teg aaa gaa ccc tgg ttc 

Gin Arg Ser Gly Phe Pro Asp Ala Leu Phe Ser Lys Glu Pro Trp Phe 

105 110 115 

aag aac atg ttt gag gac ttc cgt gag cag cac cct age gag gtc ate 

45 Lys Asn Met Phe Glu Asp Phe Arg Glu Gin His Pro Ser Glu Val lie 



56 



104 



<400> 5 

agegacttga atttagegaa aagaacttgt" caaccacaat c atg tec aac aca ate 

Met Ser Asn Thr lie 
1 5 

gtc gtc gtt ggt gcc ggt gtc att ggc ttg acg teg gcc ttg ttg etc 
Val Val Val Gly Ala Gly Val He Gly Leu Thr Ser Ala Leu Leu Leu 

10 15 20 

20 tec aag aac aag ggc aac aag ate ace gtc gtg gcc aag cac atg ccc 152 
Ser Lys Asn Lys Gly Asn Lys He Thr Val Val Ala Lys His Met Pro 

25 • 30 .35 

200 



ggc gac tat gac gtt gaa tac gcc teg cct ttt get ggt gcc aac 
25 Gly Asp Tyr Asp Val Glu Tyr Ala Ser Pro Phe Ala Gly Ala Asn His 

40 45 50 

tec ccc atg gcg acg gaa gag age age gaa tgg gaa cgt cgc act tgg 248 
Ser Pro Met Ala Thr Glu Glu Ser Ser Glu Trp Glu Arg Arg Thr Trp 
30 55 60 65 

tac gag ttt aag aga ctg gtc gag gag gtc cct gag gcc ggt gtt cat 296 
Tyr Glu Pile Lys Arg Leu Val Glu Glu Val Pro Glu Ala Gly Val His 
70 75 80 85 



ttc cag aag tct cgc ate cag agg cgc aat gtg gac act gaa aag gcg 34 4 

Phe Gin Lys Ser Arg He Gin Arg Arg Asn Val Asp Thr Glu Lys Ala 

90 95 100 



392 



440 
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120 



125 



130 



ccc ggt tac gac tct ggc tgc gag ttc aca teg gtg tgc ate aac acg 
Pro Gly Tyr Asp Ser Gly Cys Glu Phe Thr Ser Val Cys He Asn Thr 
135 140 145 

gee ate tac etc ccc tgg etc etc ggc cag tgc ate aag aat ggc gtc 
Ala He Tyr Leu ?ro Trp Leu Leu Gly Gin Cys He Lys Asn Gly Val 
150 155 160 165 

ate gtc aag cgc gec ate etc aac gac att age gag gec aag aag ctg 
He Val Lys Arg Ala He Leu Asn Asp He Ser Glu Ala Lys Lys Leu 

170 "5 1B0 



488 



536 



584 



age cac gcg ggc aag acg ccc 
Ser His :Ala Gly Lys Thr Pro 

185 



ate ate gtc aac gee acg ggt etc 
He He Val Asn Ala Thr Gly Leu 
190 • 



632 



ggc tec tac aag ctg ggc- ggt gtc gag gac aag acc atg gcg cct gcg 
Gly Ser Tyr Lys Leu Gly Gly Val Glu Asp Lys Thr Met Ala Pro Ala 
200 205 210 

egg gga cag att gtg gtt gtg cgc aac gag age age. ccc atg etc etc 
Arg Gly Gin He Val Val Val Arg Asn Glu Ser Ser Pro Met Leu Leu 
215 220* ' 225 

act tea ggt gtc gag gac ggc ggt get gat gtc atg tac ttg atg cag 
Thr Ser Gly Val Glu Asp Gly Gly Ala Asp Val Met Tyr Leu Met Gin 
230 235 _ 240 245 

gca get ggc ggt ggo acc ate ctg ggc ggt acc tac gac gtt ggc 
Ala Gly Gly Gly Thr He Leu- Gly Gly Thr Tyr Asp Val Gly 
250 • 255 260 



Arg Ala 



aac tgg gag tct cag cca gac ccc aac ate gcg aat ego ate atg cag 
Asn Trp Glu Ser Gin Pro Asp Pro Asn He Ala Asn Arg He Met Gin 

265 270 275 

cge ate gtc gag gtg egg ccc gag att gee aac ggc. aag ggc gtc aag 
Arg He Val Glu Val Arg Pro Glu He Ala Asn Gly Lys Gly Val Lys 
280 285 290 

ggg ctg age gtg ate cga cac gee gtc ggc atg egg ccg tgg cga aag 
Gly Leu Ser Val He Arg His Ala Val Gly Met Arg Pro Trp Arg Lys 
295 300 305 



680 



728 



776 



824 



872 



920 



968 
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gac gga gtc agg ate gag gag gag aag ctg gat gat gag act tgg ate 1016 

Asp Gly Val Arg lie Glu Glu Glu Lys Leu Asp Asp Glu Thr Trp lie 
310 315 320 325 

5 

gtg cac aac tac gga cac tct gga tgg ggt tac cag ggt teg tat ggt 1064 

Val His Asn Tyr Gly His Ser Gly Trp Gly Tyr Gin Gly Ser Tyr Gly 

330 335 340 

10 tgt get gag aat gta gtc cag ttg gtt gac aag gtc ggc aag gcg gee 1112 
Cys Ala Glu Asn Val Val Gin Leu Val Asp Lys Val Gly Lys Ala Ala 

345 350 355 

aag tct aag ctg tagttgaaaa ggcctgaatg agtaatagta attggatatt 1164 
15 Lys Ser Lys Leu 

360 

ggaaataccg tatttgeect eg 1186 

20 <210> 6 

<211> 361 

<212> PRT 

<213> Nectria haematococca 

25 <400> 6 

Met Ser Asn Thr lie Val Val Val -Gly Ala Gly Val lie Gly Leu Thr 
1 5 10 15 

Ser Ala Leu Leu Leu Ser Lys Asn Lys Gly Asn Lys He Thr Val Val 
30 20 25 30 

Ala Lys His Met Pro Gly Asp Tyr Asp Val .Glu. Tyr Ala Ser Pro Phe 
35 40 45 

35 Ala Gly Ala Asn His Ser Pro Met Ala Thr Glu Glu Ser Ser Glu Trp 
50 55 60 



40 



Glu Arg Arg Thr Trp Tyr Glu Phe Lys Arg Leu Val Glu Glu Val Pro 

65 70 75 80 

Glu Ala Gly Val His Phe Gin Lys Ser Arg He Gin Arg Arg Asn Val 

85 90 95 



Asp Thr Glu Lys Ala Gin Arg" Ser Gly Phe Pro Asp 'Ala Leu Phe Ser 
45 100 105 110 
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Lys Glu Pro Trp Phe Lys Asn Met Phe Glu Asp Phe Arg Glu Gin His 
115 120 125 

5 Pro" Ser Glu Val lie Pro Gly Tyr Asp Ser Gly Cys Glu Phe Thr Ser 
130 135 140 

Val Cys lie Asn Thr Ala He Tyr Leu Pro Trp Leu Leu Gly Gin Cys 
145 150 155 160 

10 

He Lys Asn Gly Val He Val Lys Arg Ala He Leu Asn Asp He Ser 

165 170 175 

Glu Ala Lys Lys Leu Ser His Ala Gly Lys Thr Pro Asn He He Val 
15 180 185 190 

Asn Ala Thr Gly Leu Gly Ser Tyr Lys Leu Gly Gly Val Glu Asp Lys 
195 200 205 

20 Thr Met Ala Pro Ala Arg Gly Gin He Val Val Val Arg Asn Glu Ser 
210 215 220 



Ser Pro Met Leu Leu Thr Ser Gly Val Glu Asp Gly Gly Ala Asp Val 
• 225 230 235 240 

25 

Met Tyr Leu Met Gin Arg Ala Ala Gly Gly Gly Thr He Leu Gly Gly 

245 250 255 



Thr Tyr Asp Val Gly Asn Trp 
30 2£0 

Asn Arg He Met Gin Arg He 
275 

35 

Gly Lys Gly Val Lys Gly Leu 
290 295 

Arg Pro Trp Arg Lys Asp Gly 
40 305 310 

Asp Glu Thr Trp He Val His 

325 



Glu Ser Gin Pro Asp Pro Asn He Ala 
265 270 

Val Glu Val Arg -Pro Glu He Ala Asn 
280 285 



Ser Val He Arg His Ala Val Gly Met 

300 

Val Arg He Glu Glu Glu Lys Leu Asp 

315 320 

Asn Tyr Gly His Ser Gly Trp Gly Tyr 
330 335 



45 
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Glu Asn Val Val Gin Leu Val Asp Lys 
345 350 



Val Gly Lys Ala Ala Lys Ser Lys Leu 
5 355* 360 



<210> 7 

<211> 1071 

<212> DNA 

10 <213> Trigonopsis variabilis 

<220> 

<221> CDS 

<222>* (1) . • (1068) 
15 <223> * 



<400> 7 • 

atg get aaa ate gtt gtt att ggt gec' ggt gtt gec ggt tta act aca 

Met Ala Lys He Val Val lie Gly Ala Gly Val Ala Gly Leu Thr Thr 

20 1 5 10 15 

get ctt caa ctt ctt cgt aaa gga cat gag gtt aca att gtg tec gag 

Ala Leu Gin Leu Leu Arg Lys .Gly His Glu Val Thr* He Val Ser Glu 

20 25 ■ 30 

25 

ttt acg ccc ggt gat ctt agt ate gga tat acc teg cct tgg gca ggt 

Phe Thr Pro Gly Asp Leu Ser He Gly Tyr Thr Ser Pro Trp Ala Gly 

35 40 45 

30 gec aac tgg etc aca ttt tac gat gga ggc aag tta gec gac tac gat 

Ala Asn Trp Leu Thr Phe Tyr Asp Gly Gly Lys Leu Ala Asp Tyr Asp 

50 55 60 • 



40 



gec gtc tct tat cct . ate ttg cga gag ctg get cga age age ccc 
35 Ala Val Ser Tyr Pro He Leu Arg Glu Leu 'Ala Arg Ser Ser Pro Glu 
65 '70 75 80 



get gga att cga etc ate age caa cgc tec cat gtt etc aag cgt gat 
Ala Gly He Arg Leu He Ser Gin Arg Ser His Val Leu Lys Arg Asp 



ctt cct aaa ctg gaa gtt gec atg teg gee ate tgt caa cgc aat 
Leu Pro Lys Leu Glu Val Ala Met Ser Aia He Cys Gin Arg Asn 

'100 105 HO 



48 



96 



144 



192 



240 



288 



336 



45 



15 



20 



40 
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tgg ttc aaa aac aca gtc gat tct ttc gag att ate gag gac agg tec 

Trp Phe Lys Asn Thr Val. Asp Ser Phe Glu lie He Glu Asp Arg Ser 
115 120 125 

5 agg att gtc cac gat gat gtg get tat eta gtc gaa ttt cgt tec gtt 

Arg He Val His Asp Asp Val Ala Tyr Leu Val Glu Phe Arg Ser Val 

130 135 140 

tgt ate cae acc gga gtc tac ttg aac tgg ctg atg tec caa tgc tta 

10 Cys He His Thr Gly Val Tyr Leu Asn Trp Leu Met Ser Gin Cys Leu 
145 150 155 160 



teg etc ggc gee acg gtg gtt aaa cgt cga gtg aac cat ate aag gat 
Ser Leu Gly Ala Thr Val Val Lys Arg Arg Val Asn His He Lys Asp 

165 . 170 ■ 175 

■ 

gee aat tta eta cac tec tea gga tea cgc ecc gac gtg att gtc aac 

Ala' Asn Leu Leu His Ser Ser Gly Ser Arg Pro Asp Val He Val Asn 

1B0 185 ' 190 

■ 

* • 

tgt agt ggt etc ttt gee egg ttc ttg gga ggc gtc gag gac aag aag 
Cys Ser Gly Leu Phe Ala Arg Phe Leu Gly Gly Val Glu Asp Lys Lys 
195 " • 200 -205 

25 atg tac cct att cga gga caa gtc gtc ctt gtt cga aac tct ctt ect 
Met Tyr Pro He Arg Gly Gin Val Val Leu Val Arg Asn Ser Leu Pro 
210 215.- 220 

ttt atg gee tec ttt tec age act cct gaa aaa gaa aat gaa gac gaa 
30 Phe Met Ala Ser Phe Ser Ser Thr Pro Glu Lys Glu 'Asn Glu Asp Glu 
225 230- • ■ 235 240 

get eta tat ate atg acc cga ttc gat ggt act tct ate att ggc ggt 
Ala Leu Tyr lie Met Thr Arg Phe Asp Gly. Thr Ser He lie Gly Gly 
35 245 250 255 

tgt ttc caa ccc aac aac tgg tea tec gaa ccc gat cct tct etc acc 
Cys Phe Gin Pro Asn Asn Trp Ser Ser Glu Pro Asp Pro Ser Leu Thr 

260 ' 265 270 



cat cga ate ctg tct aga gec etc gac cga ttc ccg gaa ctg acc aaa 
His Arg lie Leu Ser Arg Ala Leu Asp Arg Phe Pro Glu Leu Thr Lys 
275 280 285 



384 



432 



480 



528 



576 



624 



672 



720 



7 68 



816 



8 64 



45 
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gat ggc cct ctt gac att gtg cgc gaa tgc gtt ggc cac cgt cct ggt 912 
Asp Gly Pro Leu Asp lie Val Axg Glu Cys Val Gly His Arg Pro Gly 
290 295 300 

5 aga gag ggc ggt ccc cga gta gaa tta gag aag ate ccc ggc gtt ggc 960 
Arg Glu Gly Gly Pro Arg Val Glu Leu Glu Lys lie Pro Gly Val Gly 
305 310 315 320 

ttt gtt gtc cat aac tat ggt gec gec ggt get ggt tac caa tec tct 1008 
10 Phe Val Val His Asn Tyr Gly Ala Ala Gly Ala Gly Tyr Gin Ser Ser 

325 330 335 



tac ggc atg get gat gaa get gtt tct tac gtc gaa aga get ctt act 
Tyr Gly Met Ala Asp Glu Ala Val Ser Tyr Val Glu Arg Ala Leu Thr 
15 340 345 350 

cgt cca aac ctt tag 
Arg Pro Asn Leu • 
355 

20 

<210> 8 
<211> 356 
<212> PRT 
25 <213> Trigonopsis variabilis 

■ 

<400> 8 

Met Ala Lys lie Val Val He Gly Ala Gly Val Ala Gly Leu Thr Thr 
1 5 10 15 



30 



Ala Leu Gin Leu Leu Arg Lys Gly His Glu Val Thr lie Val Ser Glu 

20 25 30 



Phe Thr Pro Gly Asp Leu Ser He Gly. Tyr Thr Ser Pro Trp Ala Gly 
35 35 40 45 

Ala Asn Trp Leu Thr Phe Tyr Asp Gly Gly Lys Leu Ala Asp Tyr Asp 
50 55 60 

40 Ala Val Ser Tyr Pro He Leu Arg Glu Leu Ala Arg Ser Ser Pro Glu 
65 70 75 B0 

Ala Gly He Arg Leu He Sex Gin Arg Ser His Val Leu Lys Arg Asp 

'85 50 95 

45 



1056 



1071 
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Leu Pro Lys Leu Glu Val Ala Met Ser Ala lie Cys Gin Arg Asn Pro 

100 105 HO 

Tip Phe Lys Asn Thr Val Asp Sex Phe Glu lie He Glu Asp Arg Ser 
5 115 120 125 

Arg lie Val His Asp Asp Val Ala Tyr Leu Val Glu Phe Arg Ser Val 
130 135 140 

10 Cys lie His Thr Gly Val Tyr Leu Asn Trp Leu Met Ser Gin Cys Leu 
145 150 155" 160 

Ser Leu Gly Ala Thr Val Val Lys Arg Arg Val Asn His lie Lys Asp 

165 170 175 . 

15 

Ala Asn Leu Leu His Ser Ser Gly Ser Arg Pro Asp Val lie Val Asn 

180 1B5 190 

Cys Ser Gly Leu Phe Ala Arg Phe Leu Gly Gly Val Glu Asp Lys Lys 
20 195 200 205 

Met Tyr Pro lie Arg Gly Gin Val Val Leu Val Arg Asn Ser Leu Pro 
210 215 220 

25 Phe Met Ala Ser Phe Ser Ser Thr Pro Glu Lys Glu Asn Glu Asp Glu 
225 230 235 240 

Ala Leu Tyr He Met Thr Arg Phe Asp Gly Thr Ser He He Gly Gly 

245 * 250 255 

30 

Cys Phe Gin Pro Asn Asn Trp Ser Ser Glu Pro Asp Pro. Ser Leu Thr 

260 ' 265 270 

His Arg He Leu Ser Arg Ala Leu Asp Arg Phe Pro. Glu Leu Thr Lys . 
35 275 280 2B5 

Asp Gly Pro Leu Asp He Val Arg Glu Cys Val Gly His Arg Pro Gly 
290 295 300 

■ 

40 Arg Glu Gly Gly Pro Arg Val Glu Leu Glu Lys He Pro Gly Val Gly • 
305 310 315 320 



45 



Phe Val Val His Asn Tyr Gly Ala Ala Gly Ala Gly Tyr Gin Ser Ser 

325 330 335' 
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Tyr Gly Met Ala Asp Glu Ala Val Ser Tyr Val Glu Arg Ala Leu Thr 

340 345 350 

m 

« 

Arg Pro Asn Leu 
5 355 



10 



<210> 9 

<211> 1047 

<212> DNA 

<213> Schizosaccharamyces pombe 



<220> 

<221> CDS 

15 <222> (22).. (1041) 

<223> coding for DAAO 



20 



<400> 9 

atgactaagg aaaataagcc a aga gat att gtc ate "gtt ggc get ggc gtt 

Arg Asp He Val lie Val Gly Ala Gly Val 
- 1 5 10 



51 



25 



att gga ttg acc act get tgg att ctt tea 
lie Gly Leu Thr Thr- Ala Trp He Leu Ser 

15 20 



ttg ggt ctt get cct 
Leu Gly Leu Ala Pro 

25 



99 



30 



cgt att aag gtg att gcc aag tat acg cct gaa gat cgt tct gta gaa 
Arg He Lys Val lie Ala Lys Tyr Thr Prq Glu Asp Arg Ser Val Glu 

30 35 40 

tac act tec cct tgg get ggc gca aat ttc fcgt age att tct get act 
Tyr Thr Ser Pro Trp Ala Gly Ala Asn Phe Cys Ser He Ser Ala Thr 
45 50 



147 



195 



35 gat* gac aat get ttg cgc tgg gat aaa ate act tac cat cgt ttc gcc 
Asp Asp Asn Ala Leu Arg Trp Asp Lys 'He Thr Tyr His Arg Phe Ala 
60 65 70 



243 



tac ttg gcg aaa act cgt cct gaa gca gga ate cgt ttt get gat ctt 
40 Tyr Leu Ala Lys Thr Arg Pro Glu Ala Gly He Arg Phe Ala Asp Leu 
75 80 85 90 



291 



cga gaa ttg tgg gag tac gag ccg aaa cac gac aaa ate aga tec tgg 
* Arg Glu Leu Trp Glu "Tyr Glu Pro Lys His 'Asp Lys He Arg Ser trp 
45 95 100 105 



339 
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20 



40 



aat acc tat gtc aga gat ttc aaa gtt ate cct gaa aaa gat ctt cca 387 
Asn Thr Tyr Val Arg Asp Phe Lys Val lie Pro Glu. Lys Asp Leu Pro 

110 115 120 



cac cct gaa gac act ttg gat ate ctt aag aga aca teg get ttg atg 
His Pro Glu Asp Thr Leu Asp lie Leu Lys Arg Thx Ser Ala Leu Met 

255 260 265 

cca gaa ttg ttc cac ggc aag ggt ccg gag ggt get gaa att att caa 
Pro Glu Leu Phe His Gly Lys Gly Pro Glu Gly Ala Glu* He He Gin 

270 275. 280 



5 gga gaa tgt ate tac gga cat aag gec acc acc ttt tta ate aac get 435 
Gly Glu Cys He Tyr Gly His Lys Ala Thr Thr Phe Leu He Asri Ala 
125 130 135 

cct cat tac ttg aat tat atg tac aag ctg etc att gaa get ggc gtc 483 
10 Pro His Tyr Leu Asn Tyr Met Tyr Lys Leu Leu He Glu Ala Gly Val 
140 145 150 

gaa ttt gaa aag aaa gaa ttg agt cac ate aaa gag act gtc gaa gaa 531 
Glu Phe Glu Lys Lys Glu Leu Ser His He Lys Glu. Thr Val Glu Glu 
15 155 160 165 170 

act cca gaa get tea gta gta ttt aat tgc act ggt etc tgg get tec 579 
Thr Pro Glu Ala Ser Val Val Phe Asn Cys Thr Gly Leu Trp Ala Ser 

175 180 185 

aaa ttg ggt ggc gtt gaa gac ccg gac gtt tat ccg act cgt gga cat 
Lys Leu . Gly Gly Val Glu Asp' Pro Asp Val Tyr Pro Thr Arg § Gly His 

190 195 ■ 200 

25 gtt gtt ttg gtt aag get cct cat gta aca gaa act cgc att ttg aat 
Val Val Leu Val Lys Ala Pro His Val Thr Glu Thr; Arg He Leu Asn 
205 210 215 

ggc aag aac tct gat acc tat att att cct cgt ccc tta aat ggt gga 
30 Gly Lys Asn Ser Asp Thr Tyr He He Pro Arg Pro Leu Asn Gly Gly 
220 225 230 

gtc att tgc ggc ggt ttc atg caa cca gga aac tgg gat cgt gaa att 771 
Val He Cys Gly Gly Phe Met Gin Pro Gly Asn Trp Asp Arg Glu He • 
35 235 240 245 250 



627 



675 



723 



B19 



867 



45 
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gaa tgt gtc gga ttc cgt cct tct cga aag ggt ggt gcc cgc gta gag 915 
Glu Cys Val Gly Phe Arg Pro Ser Arg Lys Gly Gly Ala Arg Val Glu 
285 290 295 

5 ctt gat gtt gtt ccc ggc acc tea gtc ccc ctt gtt cat gat tac ggt 963 
Leu Asp Val Val Pro Gly Thr Ser Val Pro Leu Val His Asp Tyr Gly 
300 305 310 

get tct ggc aca gga tac caa get ggt tat ggt atg get ctt gac -tct 1011 
10 Ala Ser Gly Thr Gly Tyr Gin Ala Gly Tyr Gly Met Ala Leu Asp Ser 
315 320 325 330 

gtc atg ttg get ctt cct aaa ate aaa ttg gcttag 1047 
Val Met Leu Ala Leu Pro Lys lie Lys* Leu 
15 335 340 



<210> 10 

<211> 340 

20 <212> PRT 

<213> Schizosacc&aromyces pombe 

<400> 10 

Arg Asp lie Val lie Val Gly Ala Gly Val lie Gly Leu Thr Thr Ala 
25 1 5 10 15- 

Trp lie Leu Ser Asp Leu Gly Leu Ala Pro Arg lie Lys Val lie Ala 

.20 25 30 

30 Lys Tyr Thr Pro Glu Asp Arg Ser Val Glu Tyr Thr Ser- Pro Trp Ala 

40 45 



35 



Gly Ala Asn Phe Cys Ser lie Ser Ala Thr Asp Asp Asn Ala -Leu Arg 
50 55 60 

Trp Asp Lys lie Thr Tyr His Arg Phe Ala Tyr Leu Ala Lys Thr Arg 
65 70 75 80 



Pro Glu Ala Gly lie Arg Phe Ala Asp Leu Arg Glu Leu Trp Glu Tyr 
40 85 90 95 

Glu Pro Lys His Asp Lys lie Arg Ser Trp Asn Thr Tyr Val Arg Asp 

100 105 110 



45 
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Phe Lys Val lie Pro Glu Lys Asp Leu Pro Gly Glu Cys lie Tyr Gly 
115 120 125 

His Lys Ala Thr Thar Phe Leu lie Asn Ala Pro His Tyr Leu Asn Tyr 
.5 130 135 140 

Met Tyr Lys Leu Leu lie Glu Ala Gly Val .Glu Phe Glu Lys Lys Glu 
145 150 155 160 

10 Leu Ser His He Lys Glu Thr Val Glu Glu Thr Pro Glu Ala Ser Val 

165 170 175 

Val Phe Asn Cys Thr Gly Leu Trp Ala Ser Lys Leu Gly Gly Val Glu 

180 185 190 

15 

Asp Pro Asp Val Tyr Pro Thr Arg Gly His Val Val Leu Val Lys Ala 
195 200 205 

Pro His Val Thr Glu Thr Arg He Leu Asn Gly Lys Asn Ser Asp Thr 
20 210 215 220 

Tyr lie lie Pro Arg Pro Leu Asn Gly Gly Val lie Cys Gly Gly Phe 
225 ' 230 235 240 

25 Met Gin Pro Gly Asn Trp Asp Arg Glu He His Pro Glu Asp Thr Leu 

245 250 255 

Asp He Leu Lys. Arg Thr Ser Ala Leu Met Pro Glu Leu Phe His Gly 

260 265 270 

30 

Lys Gly Pro Glu- Gly Ala Glu He He Gin Glu Cys Val Gly Phe Arg 
275 2B0 2B5 



35 Pro Ser Arg Lys Gly Gly Ala Arg Val Glu Leu Asp Val Val Pro Gly 
290 295 300 

Thr Ser Val Pro Leu Val His Asp Tyr Gly Ala Ser Gly Thr Gly Tyr 
305 310 315 320 

40 

Gin Ala Gly Tyr Gly Met Ala Leu Asp Ser Val Met Leu Ala Leu Pro 

325 330 • 335 



45 



Lys lie' Lys Leu 

340 
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<210> 11 

<211> 963 

<212> DNA 

<213> Stxeptontyces coelicolor 



10 



15 



20 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



CDS 

(31) ,.(957) 
coding for DAAO 



m±sc_ 
(B80) 
DAAO 



. (936) 



<400> 11 

gtggaaaccg aactggatga cgagcgggat ggc gaa gtc gtc gtg gtc ggc ggc 

Gly Glu Val Val Val Val Gly Gly 
1 5 

ggg gtg ate ggg ctg acg acg gec gtc gtc etc gec gag egg ggc aga 
Gly Val lie Gly Leu Thx Thx Ala Val Val Leu. Ala Glu Arg Gly Arg 
10 15 20 . 



10. 



25 egg gtg egg ctg tgg. ace egg gag ccc gcg gag egg acc ace teg gtg 
Arg Val Arg Leu Trp Thr Arg Glu Pro Ala Glu Arg Thx Thx Ser Val 
25 30 ' 35 40 

gta gcg ggc ggg ctg tgg tgg ccg tac egg ate gag ccg gtc gcg ctg 
30 Val Ala Gly Gly Leu Trp Trp Pro Tyr Arg lie Glu Pro Val Ala Leu 

45 50 55 



15 O 



19 8 



35 



gee cag gec tgg gcg ctg cgt tec ctg gac. gtg tac gag gag ctg gcg 
Ala Gin Ala Trp Ala Leu Arg Ser Leu Asp Val Tyr Glu Glu Leu Ala 

60 65 70 



24 6 



40 



gca egg ccc ggg cag acc ggc gta cgc atg etc gaa ggg gtg etc ggc 
Ala Arg Pro Gly Gin Thr Gly Val Arg Met Leu Glu Gly Val Leu Gly 
75 80 B5 

gag acc ggc ctg gac gag gtg gac ggg tgg gec gcg gec egg ctg ccg 
Glu Thr Gly Leu Asp Glu Val Asp Gly Trp Ala Ala Ala Arg Leu Pro 
90 95 100 



29 4 



34 2 



45 ggg ctg cgc gcg gcg age gee gee gag tac gec ggg acg ggg ctg tgg 



3^0 
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Gly Leu Arg Ala Ala Ser Ala Ala 
105 HO 

gcg egg ctg ccg etc ate gac atg 
5 Ala Axg Leu Pro Leu lie Asp Met 

125 

gag egg ctg ctg gee gcg ggc ggc 
Glu Arg Leu Leu Ala Ala Gly Gly 
10 140 



Glu Tyr Ala Gly Thr Gly Leu Trp 
115 • 120 

teg acc cat ctg ccg tgg ctg egg 438 
Ser Thr His. Leu Pro Trp Leu Arg 
130 135 

acg gtg gag gac cgc gcg gtg acc 486 
Thr Val Glu Asp Arg Ala Val Thr 
145 150 



gat ctg gec gag gcg 
Asp Leu Ala Glu Ala 
155 

15 

ggc gec egg gag ctg 
Gly Ala Arg Glu Leu 
170 

20 cag ctg gtc.gtc gtg 
Gin Leu Val Val* Val 
185 

gec gac gcg gac tec 
25 Ala Asp Ala Asp Ser 

205 

egg etc ctg ctg ggc 
Arg Leu Leu Leu Gly 
30 220 

ccg gac ccg gag gtc 
Pro Asp Pro Glu Val 
235 

35 

cgt ccc gag ate gee 
Arg Pro Glu lie Ala 
250 



gac gcg ccg gtg gtg gtc aac 
Asp Ala Pro Val Val Val Asn 
160 

gtg ccg gac ccg gcg gta egg 
Val Pro Asp Pro Ala Val Arg 
175 180 

■ 

gag aac ccc ggc ate cac aac 
Glu- Asn Pro Gly lie. His Asn 
19JQ 195 

« 

ggg gag acg acg tac ttc ctt 
Gly Glu Thr Thr Tyr Phe Leu 

210 

ggc acg get gag gag gac gec 
Gly Thr Ala Glu Glu Asp Ala 

225 

gcg gcg gee ate gtg cga egg 
Ala Ala Ala lie Val Arg Arg 
240 

gga gcg egg gtg etc gcg cac 
Gly Ala Arg Val Leu Ala His 
255 260 



tgc acc ggc ctg 534 

Cys Thr Gly Leu 

165 

* 

ccg gtg cgc gga 5B2 
Pro Val Arg Gly 

tgg ctg gtc gcg 630 
Trp Leu Val Ala 

• 200 

ccg cag ccg gga 678 
Pro Gin Pro Gly 
215 

tgg teg acc gag 72 6 

Trp Ser Thr Glu 

230 . " 

tgc gcg gee ctg 77 4 

Cys Ala Ala Leu 

245 

ctg gtg ggg ctg 822 
Leu Val Gly Leu 



40 egg ccg gec egg gac 
Arg Pro Ala Arg Asp 
265 

ggg cgc egg ctg gtg 
45 Gly Arg Arg Leu Val 



gcg gtc egg ctg gag cgc 
Ala Val Arg Leu Glu Arg 
270 275 

cac aac tac ggt cac ggc 
His Asn Tyr Gly His Gly 



ggg acg ctg ccg gac 87 0 

Gly Thr Leu Pro Asp 

280 

ggc gcg ggc gtc acc 918 
Gly Ala Gly Val Thr 
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285 290 295 

gtg gcc tgg ggc tgc get cag gag gcg gec egg etc gec tcctga 963 
Val Ala Trp Gly Cys Ala Gin Glu Ala Ala Arg Leu Ala 
5 300 305 



<210> 12 

<211> 309 

10 <212> PRT 

<213> Streptomyces coelicolor 

<220> 

<221> misc__feature 

15 <222> (880) -.(936) 

<223> DAAO signature 

<400> 12 

Gly Glu Val Val Val Val Gly Gly Gly Val lie Gly Leu Thr Thr Ala 
20 1 5 • 10 15 

Val Val Leu Ala Glu Arg Gly Arg Arg Val Arg Leu* Trp Thr Arg Glu 

20 25 30 

25 Pro Ala Glu Arg Thr Thr Ser Val Val Ala Gly- Gly Leu Trp Trp Pro 

35 40 45 

Tyr Arg lie Glu Pro Val Ala Leu Ala Gin Ala Trp Ala Leu Arg Ser 
50 55 60 



30 



Leu Asp Val Tyr Glu Glu Leu Ala Ala Arg Pro Gly Gin Thr Gly Val 
65 70 75 80 



Arg Met Leu Glu Gly Val Leu Gly Glu Thr Gly Leu Asp Glu Val Asp 
35 85 90 95 

Gly Trp Ala m Ala Ala Arg Leu Pro Gly Leu Arg Ala Ala Ser Ala Ala 

100 105 110 

40 Glu Tyr Ala Gly Thr Gly Leu Trp Ala Arg Leu Pro Leu He Asp Met 

115 120 ' 125 

Ser Thr His Leu Pro Trp Leu Arg Glu Arg Leu Leu Ala Ala Gly Gly 
130 135 ' 140 

45 
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Thr Val Glu Asp Arg Ala Val Thr Asp Leu Ala Glu Ala Asp Ala Pro 
145 150 * 155 160 

Val Val Val Asn Cys Thr Gly Leu Gly Ala Arg Glu Leu Val Pro Asp 
5 .165 170 175 

Pro Ala Val Arg Pro Val Arg Gly Gin Leu Val Val Val Glu Asn Pro 

180 1B5 190 

10 Gly lie His Asn Trp Leu Val Ala Ala Asp Ala Asp Sezr Gly Glu Thr 

195 200 205 

Thr Tyr Phe Leu Pro Gin Pro Gly Arg Leu Leu Leu Gly- Gly Thr Ala 
210 215 220 

15 " 

Glu Glu Asp Ala Trp Ser Thr Glu Pro Asp Pro Glu Val- Ala Ala Ala 
225 230 235 240 

lie Val Arg Arg Cys Ala Ala Leu Arg Pro Glu He Ala Gly Ala Arg 
20 245 250 255 

Val Leu Ala His Leu Val.. Gly Leu Arg Pro Ala Arg As]=> Ala Val Arg 

260 .265 " 270 

25 Leu Glu Arg Gly Thr. Leu Pro Asp Gly Arg Arg Leu Val His Asn Tyr 

275 280 285 

a- 

Gly His Gly Gly Ala Gly Val Thr Val Ala Trp Gly Cys Ala Gin Glu 
290 - 295 300 

30 

Ala Ala Arg Leu Ala 
305 

<210> 13 
35 <211> 1038 
<212> DNA 

<213> Candida boidinii 

* 

<220> 

40 <221> CDS 

<222> (1)..(1035) 

<223> coding for DAAO 



45 
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<400> 13 

atg ggt gat caa att gtt .gtt ctt ggt tec ggt att att ggt tta tat 
Met Gly Asp Gin lie Val Val Leu Gly Ser Gly lie lie* Gly Leu Tyr 
15 10 15 



48 



act aca tac tgt tta ate tat gag get gga tgt get cca get aaa att 
Thr Thr Tyr Cys Leu He Tyr Glu Ala Gly Cys Ala Pro Ala Lys He 

20 25 30 



96 



10 act att gtt get gaa ttt tta cca ggt gat caa tct aca tta tat aca 
Thr He Val Ala Glu Phe Leu Pro Gly Asp Gin Ser Thr Leu Tyr Thr 
35 40 45 



144 



tct cca tgg gca ggt ggt aat ttt tct tgt att tea cca get gat gat 
15 Ser Pro Trp Ala Gly Gly Asn Phe Ser Cys He Ser Pro Ala Asp Asp 
50 55 60 



192 



aca aca ttg get tat gat aaa ttc aca tat ctt aat tta ttc aag att 
Thr Thr Leu Ala Tyr Asp Lys Phe Thr Tyr Leu Asn Leu Phe Lys He 
20 65 70 • 75 80 



240 



25 



cac aaa aaa tta ggt gga cca gaa tgt gga tta gat aat aag cca agt 
His Lys Lys Leu Gly Gly Pro Glu Cys Gly Leu Asp Asn Lys Pro Ser 

*85 90 95 

• ■ 

act gaa tat tgg gat ttt tat cct ggt gat gaa aaa gtc aat tct tta 
Thr Glu Tyr Trp Asp Phe Tyr Pro Gly Asp Glu Lys "Val Asn Ser Leu 

100 105 110 



288 



336 



30 aaa caa tat ctt aaa gat ttt "aaa gtt ■ att cca aaa tea gaa tta cca 
Lys Gin Tyr Leu Lys Asp Phe Lys Val He Pro Lys Ser Glu Leu Pro 
115 120 125 



384 



gaa ggt gtt gaa tat ggt att agt tat act aca tgg aat ttc 
35 Glu Gly Val Glu Tyr Gly He Ser Tyr Thr Thr Trp Asn Phe 
130 135 140 



tgt 
Cys 



432 



cct gtt ttc tta caa aat atg get aat ttt tta aat aaa aga aat gtt 
Pro Val- Phe Leu Gin Asn Met Ala Asn Phe Leu Asn Lys Arg Asn Val 
40 145 150 155 160 



480 



45 



ace att att 
Thr He He 



aaa cat 
Lys His 
165 ' 



aca cat att tct caa get tat tta aca 
Thr His He Ser Gin Ala Tyr Leu Thr 
170 * 175 * 



528 
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gtt aat aca aaa gtt gtt ttc aac tgt - aca ggt att ggt get get gat 
Val Asn Thr Lys Val Val Phe Asn Cys Thr Gly lie Gly Ala Ala Asp 

180 185 190 

5 tta ggt ggt gtt aaa gat gaa aaa gtt tat cca act aga gga caa gtt 
Leu Gly Gly Val Lys Asp Glu Lys Val Tyr Pro Thr Arg Gly Gin Val 
195 200 205 

gtt gtt gtt aga get cca cat att caa gaa aat aaa atg aga tgg ggt 
10 Val Val Val Arg Ala Pro His He Gin Glu Asn Lys Met Arg Trp Gly 
210 215 220 

* 

aaa gac tat get act tat att att cca aga cca tat tct aat ggt gaa 

Lys Asp Tyr Ala Thr Tyr He lie Pro Arg Pro Tyr Ser Asn Gly Glu 

15 225 230 235 240 

tta gtc tta ggt ggt ttc tta caa aag gat aat tgg aca ggt aat act 

Leu Val Leu Gly Gly Phe Leu Gin Lys Asp Asn Trp Thr Gly Asn Thr 

245 • -250 ' 255 

20 

ttt ggt ttt gaa act gat gat att gtt agt aga act aca tct tta tta 

Phe Gly Phe Glu Thr Asp Asp.. lie- Val. Ser Arg - -Thx Thr Ser Leu Leu 

260 265 270 

25 cca aag att tta gat gaa cca ctt cat att att aga gtt gca get ggt 
Pro Lys He Leu Asp Glu Pro Leu His lie He Arg Val Ala Ala Gly 
275 280 285 

tta aga cca agt aga cat ggt ggt cca aga att gaa get gaa gtt tgt 
30 Leu *Arg Pro Ser Arg His Gly Gly Pro Arg He Glu Ala Glu Val Cys 
290 295 300 

« 

gaa gaa ggt aaa tta act att cat aat tat ggt get tct gga tat ggt 

Glu Glu Gly Lys Leu Thr He His Asn Tyr Gly Ala Ser Gly Tyr Gly 

35 305 310 315 320 

tat caa get ggt tat ggt atg tct tat gaa get gtc aaa ctt tta gtt 

Tyr Gin Ala Gly Tyr Gly Met Ser Tyr Glu Ala Val Lys Leu Leu Val 

325 330 335 

40 

gat aac caa aaa gtt aaa get aaa ctt tag 

Asp Asn Gin Lys Val Lys Ala Lys Leu 

340 345 



57 6 



624 



672 



720 



768 



816 



8 64 



912 



960 



1008 



1038 



45 
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<210> 14 
<211> 345 
<212> PRT 

<213> Candida boidinii 

5 

<400> 14 

Met Gly Asp Gin lie Val Val Leu Gly Ser Gly lie lie Gly Leu Tyr 
15 10 15 

10 Thr Thr Tyr Cys Leu lie Tyr Glu Ala Gly Cys Ala Pro Ala Lys lie 

20 25 30 

Thr He Val Ala Glu Phe Leu Pro Gly Asp Gin Ser Thr Leu Tyr Thr 
35 40 45 

15 

Ser Pro Trp Ala Gly Gly Asn Phe Ser Cys He Ser Pro Ala Asp Asp 
50 55 60 

Thr Thr Leu Ala Tyr Asp Lys Phe Thr Tyr Leu Asn Leu Phe Lys lie' 
20 65 70 "75 80 

* 

His Lys Lys Leu Gly Gly Pro Glu Cys Gly Leu Asp. Asn Lys . Pro Ser 

85 . 90 95 

m 

25 Thr Glu Tyr Trp Asp Phe Tyr Pro Gly Asp Glu Lys Val Asn Ser Leu 

100 105 HO 

Lys Gin Tyr Leu Lys Asp Phe Lys Val He Pro Lys Ser Glu Leu Pro 
115 120 125 

30 

Glu Gly Val Glu Tyr Gly He Ser Tyr Thr Thr Trp Asn Phe Asn Cys 
130 135 140 

* ¥ 

Pro Val Phe Leu Gin Asn Met Ala Asn Phe Leu Asn Lys Arg Asn Val - 
35 145 150 155 160 

Thr He He Arg Lys His Leu Thr His He Ser Gin Ala Tyr Leu Thr 

165 1*70 175 

40 Val Asn Thr Lys Val Val Phe Asn Cys Thr Gly He Gly Ala Ala Asp 

180 185 190 



Leu Gly Gly Val Lys Asp Glu Lys Val Tyr Pro Thr Arg Gly Gin Val 
195 200 205 

45 
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Val Val Val Arg Ala Pro His lie Gin Glu Asn Lys Met Arg Trp Gly 

210 215 220 

Lys Asp Tyr Ala Thr Tyr lie lie Pro Arg Pro Tyr Ser Asn Gly <31u 
5 225 230 235 240 



Leu Val Leu Gly Gly Phe Leu Gin Lys Asp Asn Trp Thr Gly Asn Thr 

245 250 255 

10 

Phe Gly Phe Glu Thr Asp Asp He Val Ser Arg Thr Thr Ser Leu Leu 

260 265 270 

Pro Lys He Leu Asp Glu Pro Leu His He He Arg Val Ala Ala Gly 
15 275 2B0 285 

Leu Arg Pro Ser Arg His Gly Gly Pro Arg He Glu Ala Glu Val Cys 
290 295 300 

20 Glu Glu Gly Lys Leu Thr He His Asn Tyr Gly Ala Ser Gly Tyr Gly 
305 310 315 320 

Tyr Gin Ala Gly Tyr Gly Met Ser Tyr Glu Ala Val Lys Leu Leu Val 

325 330 335 

25 

Asp Asn Gin Lys Val Lys Ala Lys Leu 

340 345 



30 <210> 15 

. <211> 12466 

<212> DNA 

<213> vector daaoScelTetON 



35 



<220> 
<221> 
<222> 
<223> 



misc. 
(38) 



. (183) 



Agrobacterium right border 



40 



<220> 
<221> 
<222> 
<223> 



mi s cofeature 
(445).. (462) 
recognition / 



cleavage site for I-Scel endonuclease 
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<220> 
<221> 
<222> 
<223> 



(196) (400) 

complementary: 35 S terminator 



10 



15 



20 



<220> 

* 

<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

■ 

<220> 
<221> 
<222> 
<223> 



mi s cofeature 
(515) (1222) 

■ 

complementary: coding for I-Scel endonuclease 



promoter 
(1270) (1660) 

complementary: coding for pTOPlOP teracyclin regulatable promoter 



terminator 
(1735) (1990) 

complementary: NOS terminator 



25 



misc_ 
(2067) 

co: 
se 



. (3173) 



plementary: co 



for Rhodotorula gracilis D- amino acid oxida 



30 



<220> 
<221> 
<222> 
<223> 



(3217) .. (5028) 

complementary: Arabidopsis thai i ana nitrilase I promoter 



35 



<220> 
<221> 
<222> 
<223> 



terminator 
(5118) (5343) 

complementary: OCS terminator 



40 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(5418) (6425) 

complementary: coding for tetracyclin repressor rtTA 



<220> 

<221> promoter 

<222> (6*479) (7341) 

45 <223> complementary : coding for Pisum sativum ptxA promoter 
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<223> complementary: coding for 23S terminator (functioning as homology 



<220> 

<221> terminator 
<222> (7345)'. . (7549) 



<221> mis cofeature 
<222> (7618) . - (7834) 
10 <223> Agrobacterium left border 

<400> 15 

aatattcaaa caaacacata cagcgcgact tatcatggac atacaaatgg acgaacggat 60 

aaaccttttc acgccctttt aaatatccga ttattctaat aaacgctctt ttctcttagg 120 

15 tttacccgcc aatatatcct gtcaaacact gatagtttaa actgaaggcg ggaaacgaca 180 

atcagatctg gtacccggtc actggatttt ggttttagga attagaaatt ttattgatag 240 

aagtatttta caaatacaaa tacatactaa gggtttctta tatgctcaac acatgagcga 300 

aaccctataa gaaccctaat tcccttatct gggaactact cacacattat tctggagaaa 360 

aatagagaga gatagatttg tagagagaga ctggtgattt ttgcgccggg taccccaaac 420 

20 tgtctcacga cgttttgaac ccagattacc ctgttatccc tagtcgagcg gccgccagtg 48 0 

tgatggatat ctgcagaatt cgccctttta gatcttattt caggaaagtt tcggaggaga 540 

tagtgttcgg cagtttgtac atcatctgcg ggatcaggta cggtttgatc aggttgtaga 600 

agatcaggta agacatagaa tcgatgtaga tgatcggttt gtttttgttg atttttacgt 660 

aacagttcag ttggaatttg ttacgcagac ccttaaccag gtattctact tcttcgaaag 720 

25 tgaaagactg ggtgttcagt acgatcgatt tgttggtaga gtttttgttg taatcccatt 780 

taccaccatc atccatgaac cagtatgcca gagacatcgg ggtcaggtag ttttcaacca 840 

ggttgttcgg.gatggttttt ttgttgttaa cgatgaacag gctagccagt ttgttgaaag 900 

cttggtgttt gaaagtctgg gcgccccagg tgattaccag gttacccagg tggttaacac 960 

gttctttttt gtgcggcggg gacagtaccc actgatcgta cagcagacat acgtggtcca 1020 

30 tgtatgcttt gtttttccac tcgaactgca tacagtaggt tttaccttca tcacgagaac 1080 

ggatgtaagc atcacccagg atcagaccga tacctgcttc gaactgttcg atgttcagtt 1140 

* cgatcagctg ggatttgtat tctttcagca gtttagagtt cggacccagg ttcattacct 1200 

ggtttttttt gatgtttttc atatggtcga ctaaagggcg aattccagca cactggcggc 1260 

cgttactagc ccgggctcga gcaaatgtct agaaaggcct tatatacgta aagggtcttg 1320 

35 cgaagactag atcactctat ctcgagttta ccactcccta tcagtgatag agaaaagtga 1380 

aagtcgagtt taccactccc tatcagtgat agagaaaagt gaaagtcgag tttaccactc 144 0 

cctatcagtg atagagaaaa gtgaaagtcg agtttaccac tccctatccg tgatagagaa 1500 

aagtgaaagt cgagtttacc actccctatc agtgatagag aaaagtgaaa gtcgagttta 1560 

ccactcccta tcagtgatag agaaaatgaa agtcgagttt accactccct atcagtgata 1620 

40 gagaaaagtg aaagtcgagc tcggtaccga gctcgaattc agcacactgg cggccgttac 1680 

tagtggatca attcactggc cgtcgtttta caacgactca gagcttgaca ggaggcccga 174 0 

tctagtaaca tagatgacac cgcgcgcgat aatttatcct agtttgcgcg ctatattttg 1800 

ttttctatcg cgtattaaat gtataattgc gggactctaa tcataaaaac ccatctcata 1860 

aataacgtca tgcattacat gttaattatt* acatgcttaa cgtaattcaa cagaaattat 1920 

45 atgataatca tcgcaagacc ggcaacagga ttcaatctta agaaacttta ttgccaaatg 1980 
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cacaactgaa atcaaagcca gattcataat 
aacacaacat ccagaattga acactcaaat 
agagaattgg ttctcttcca ttattaacga 
ttgaaactca acataacatt ttgaacttga 
5 atcacaaccg attacttgtt tgagtcatct 
accggtttct tcgggtgctc atttacggtc 

m 

ggtagggata acagggtaat ctgggttcaa 
aattcgagct cggtaccaat tcccatcttg 
aaataacaag tcaggtatta tagtccaagc 

10 attcagaaat atttcaataa ctgattatat 
gacftgcgata ttatgtgtaa tacataaatt 
atcttgcgcc gggtaccgag ctcggtagca 
ccaggagagt tgttgatcta cccaccgtac 
tgctcaaact cgaagtcggc catatccaga 

15 aatcccggac ccggggaatc cccgtccccc 
tcggcatgcg ccatcgccac gtcctcgccg 
tcggtcgggg gggccgtcga cagtctgcgc 
g^agccgcca gccccgcctc ttcgggggcg 
atggtagacc cgtaattgtt tttcgtacgc 

20 ttaagttgtt tttctaatcc gcatatgatc 
gcaccttggt gatcaaataa ttcgatagct 
gtaggtgttt ccctttcttc tttagcgact 
aaagtaaaat gccccacagc gctgagtgca 
tggcataaaa aggctaattg attttcgaga 

25 tctgaatgta cttttgctcc attgcgatga 
ttattgcgta aaaaatcttg ccagctttcc 
ctatctaaca tctcaatggc taaggcgtcg 
tacagtgtag gctgctctac accaagcttc 
attccgacct cattaagcag ctctaatgcg 

•30 gacatggtcg atcgactcta gactagtgga 
ttcgaagatt ttagtgtaat gtgtgrtgctc 
agaatgagtg atgaggttta tatggtgaaa 
tctttctcgt gagtcatatt cacggaccat 
tttttatgtt aaatcattga ttgattttta 

35 atgaattcaa acaaactagg ccagaaaaaa 
aatt-tgttta tttttttcat ggataagggg 
gacatttttt taacatgtag tgacaagtag 
aatcttaatc tttttttcat aaatctgcac 
gtgctaaact attggtaatt tctttttgta 

40 gaaatgtgtc atcctaaaaa caacattttg 
aatggaagaa gcaatatgtt gatgtttatt 
taattgacaa ctatgatttt atattggatt 
ggtttcaaag catgttatgc tagtgattct 
tccttgaatg gagggacaaa ctttgaatga 

45 ctcagcttca caaaaaattg cggacggtca 
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tattgataga agtattttac aaatacaaat acatactaag ggtttcttat atgctcaaca 7440 

catgagcgaa accctataag aaccctaatt cccttatctg ggaactactc acacattatt 7500 

ctggagaaaa atagagagag atagatttgt agagagagac tggtgatttt ccgggggatc 7560 

ctctagagtc gaggtaccga gctcgaattc actggccgtc gttttacaac gactcagtac 7620 

5 tgcttggtaa taattgtcat tagattgttt ttatgcatag atgcactcga aatcagccaa 7680 

ttttagacaa gtatcaaacg gatgttaatt cagtacatta aagacgtccg caatgrtgtta 7740 

ttaagttgtc taagcgtcaa tttgtttaca ccacaatata tcctgccacc agccagccaa 7800 

cagctccccg accggcagct cggcacaaaa tcaccacgcg tctaaaaagg tgatgtgtat 7860 

ttgagtaaaa cagcttgcgt catgcggtcg ctgcgtatat gatgcgatga gtaaataaac 7920 

10 aaatacgcaa ggggaacgca tgaaggttat cgctgtactt aaccagaaag gcgggtcagg 7980 

caagacgacc atcgcaaccc atctagcccg cgccctgcaa ctcgccgggg ccgatgttct 8040 

gttagtcgat tccgatcccc agggcagtgc ccgcgattgg gcggccgtgc gggaagatca 8100 

accgctaacc gttgtcggca tcgaccgccc gacgattgac cgcgacgtga aggccatcgg 8160 

ccggcgcgac ttcgtagtga tcgacggagc gccccaggcg gcggacttgg ctgtgtccgc 8220 

15 gatcaaggca gccgacttcg. tgctgattcc ggtgcagcca agcccttacg acatatgggc. 8280 

caccgccgac ctggtggagc tggttaagca gcgcattgag gtcacggatg gaaggctaca 8340 

agcggccttt gtcgtgtcgc gggcgatcaa aggcacgcgc atcggcggtg aggttgccga 8400 

ggcgctggcc gggtacgagc tgcccattct tgagtcccgt atcacgcagc gcgtgagcta 84 60 

' cccaggcact gccgccgccg gcacaaccgt tcttgaatca * gaacccgagg gcgacgctgc 8520 

20 ccgcgaggtc caggcgctgg ccgctgaaat taaatcaaaa ctcatttgag ttaatgaggt 8580 

aaagagaaaa tgagcaaaag cacaaacacg ctaagtgccg gccgtccgag cgcacgcagc 8 640 < 

agcaaggctg caacgttggc cagcctggca gacacgccag. ccatgaagcg ggtcaacttt 8700 

cagttgccgg cggaggatca caccaagctg aagatgtacg cggtacgcca aggcaagacc 8760 

attaccgagc tgctatctga atacatcgcg cagctaccag agtaaatgag caaatgaata 8820 

25 aatgagtaga tgaattttag cggctaaagg aggcggcatg gaaaatcaag aacaaccagg 8880 

caccgacgcc gtggaatgcc ccatgtgtgg aggaacgggc ggttggccag gcgtaagcgg 8940 

ctgggrttgtc tgccggccct gcaatggcac tggaaccccc aagcccgagg aatcggcgtg 9000 

agcggtcgca aaccatccgg cccggtacaa atcggcgcgg cgctgggtga tgacctggtg 9060- 

gagaagttga aggccgcgca ggccgcccag . cggcaacgca tcgaggcaga agacgccccg 9120 

30 gtgaatcgtg gcaaggggcc gctgatcgaa tccgcaaaga atcccggcaa • ccgccggcag 9180 

ccggtgcgcc gtcgattagg aagccgccca agggcgacga gcaaccagat tttttcgttc 9240 

cgatgctcta tgacgtgggc acccgcgata gtcgcagcat catggacgtg gccgttttcc 9300 

gtctgtcgaa gcgtgaccga cgagctggc^ aggtgatccg ctacgagctt ccagacgggc 9360 

acgtagaggt ttccgcaggg ccggccggca tggccagtgt gtgggattac gacctggtac 9420 

35 tgatggcggt ttcccatcta accgaatcca tgaaccgata ccgggaaggg aagggagaca 94 80 

agcccggccg cgtgttccgt ccacacgttg cggacgtact caagttctgc cggcgagccg 9540 

atggcggaaa gcagaaagac gacctggtag aaacctgcat tcggttaaac accacgcacg 9600 

ttgccatgca gcgtacgaag aaggccaaga acggccgcct ggtgacggta tccgagggtg 9660 

aagccttgat tagccgctac aagatcgtaa agagcgaaac cgggcggccg gagtacatcg 9720 

40 agatcgagct agctgattgg atgtaccgcg agatcacaga aggcaagaac ccggacgtgc 9780 

tgacggttca ccccgattac tttttgatcg atcccggcat cggccgtttt ctctaccgcc 984 0 

tggcacgccg cgccgcaggc aaggcagaag ccagatggtt gttcaagacg atctacgaac 9900 

gcagtggcag cgccggagag ttcaagaagt tctgtttcac cgtgcgcaag ctgatcgggt 9960 

caaatgacct gccggagtac gatttgaagg aggaggcggg gcaggctggc ccgatcctag 10020 

45 tcatgcgcta ccgcaacctg atcgagggcg aagcatccgc cggttcctaa tgtacggagc 10080 



WO 2005/090581 



34/40 



PCT/EP2005/002734 



agatgctagg 
atagcacgta 



10 



15 



20 



25 



30 



35 



aaggcgattt 
cctgtgcata 
ggtcgctgcg 
gaaataccgc 
ctcactgact 
gcggtaatac 
ggccagcaaa 
cgcccccctg 

• 

ggactataaa 
accctgccgc 
catagctcac 
gtgcacgaac 
tccaacccgg 
agagcgaggt 
actagaagga 
gttggtagct 
aagcagcaga 
■ gggtctgacg 
tctccc^att 
tgatagtttg 
ccgcgaagcg 
gtgatctcgc 
cgatcttctt 
gccggcaggc 
actgcgctgt 
tcgggcggcg 
ggaaccggat 
gcttttgtca 
agaatgtcat 
ggaatgatgt 



gcaaattgcc 
cattgggaac 
gtacattggg 
ttccgcctaa 
actgtctggc 
ctccctacgc 

■ 

acagatgcgt 

cgctgcgctc 

ggttatccac 

aggccaggaa 

acgagcatca 

gataccaggc 

ttaccggata 

gctgtaggta 

cccccgttca 

taagacacga 

atgtaggcgg 

cagtatttgg 

cttgatccgg 

ttacgcgcag 

ctcagtggaa 

tgtgtagggc 

gctgtgagca 

gcgtcggctt 

ctttcacgta 

cttgtccaag 

gctccattgc 



ctagcagggg 
ccaaagccgt 
aaccggtcac 



cagcgcacag 
cccgccgctt 



ggtcgttcgg 
agaatcaggg 



caaaaatcga 
gtttccccct 
cctgtccgcc 
tctcagttcg 
gcccgaccgc 
cttatcgcca 
tgctacagag 
tatctgcgct 
caaacaaacc 



cgaaaactca 
ttattatgca 
attatgrtgct 



aaaaaggtcg 
acattgggaa 
acatgtaagt 
aacttattaa 
ccgaagagct 
cgcgtcggcc 
taccgcatca 
ctgcggcgag 
gataacgcag 
gccgcgttgc 
cgctcaagtc 

• 

ggaagctccc 
tttctccctt 
gtgtaggtcg 
tgcgccttat 
ctggcagcag 
ttcttgaagt 
ctgctgaagc 
accgctggta 
tctcaagaag 
cgttaaggga 



aaaaggtctc 
ccggaacccg 
gactgatata 



tttcctgtgg 
tacattggga 



gcaaaaagcg 
tatcgcggcc 
ggcgctcttc 
cggtatcagc 
gaaagaacat 
tggcgttttt 
agaggtggcg 
tcgtgcgctc 
cgggaagcgt 
ttcgctccaa 
ccggtaacta 
ccactggtaa 
ggtggcctaa 
cagttacctt 
gcggtggttt 
atcctttgat 
ttttggtcat 



acccgcctgg 
cctacccttc 
tatgcggtgt 
cgcttcctcg 



gtgagcaaaa 
ccataggctc 
aaacccgaca 
tcctgttccg 
ggcgctttct 
gctgggctgt 
tcgtcttgag 
caggattagc 
ctacggctac 



gtggacaaat 
ataagcctgt 
ccagtcggca 



agttccatag 



agccttacgg 
actgcggagc 
ccaactacct 
aactttgttt 
catcgaccca 



gcaagatagc 
tgcgctgcca 
cgtcgtgcac 
ccgaagtttc 
tcaccgtaac 



cgttaaggtt 
ctccgccgct 
cagatcaatg 



ctgatagttg 
tagggcgact 
cggcgtaacg 



40 



tagtgcatct 
ctagctagac 
tcttccaact 
ctagcttcaa 
gcgacatcct 
agcactacat 
tcatttagcg 
ggacctacca 
tcgatcgtgg 
tgcagttcgc 
aacaatggtg acttctacag 

ttgatcaaag 
atatcactgt 
aacgtcggtt 
tcggcgatca 
gtaacatcgt 
ttggatgccc 
gccactgcgt 



tacggccagc 
agrtcgatact 
gccctgctgc 
cgcrttgctgc 



aacgcttgag 
attatttgcc 
gatctgcgcg 
gtatgaoggg 
tcggcgcgat 
ttcgctcatc 
cctcaaatag 
aggcaacgct 
ctggctcgaa 
gcttagctgg 
cgcggagaat 
ctcgccgcgt 
gtggcttcag 
cgagatggcg 
ccgcttcccc 
tgctgctcca 



ttttgtttgc 
cttttctacg 
gcatgatata 
agacttgacc 
ttaagccgcg 
gactaccttg 
cgaggccaag 
ctgatactgg 
tttgccggtt 
gccagcccag 
atcctgttca 
atgttctctt 



ataacgccac 
ctcgctctct 
tgtttcatca 
gccgccatcc 
ctcgatgacg 
catgatgttt 
taacatcaaa 



tccatg 



10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

117 60 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12466 



45 



<210> 16 
<211> 12539 
<212> DNA 
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<213> daaoNit-PRecombination 



<220> 
<221> 
<222> 
<223> 



misc__feature 
(38) . . (183) 

Agrobacterium right border 



10 



<220> 
<221> 
<222> 
<223> 



terminator 
(196) . . (400) 

complementary: 35S terminator 



15 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(445).. (462) 

cleavage / recognition site for I-Scel endonuclease 



20 



<220> 
<221> 
<222> 
C223> 



(589).. (844) 

complementary: nos terminator 



25 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(921) .. (2027) 

complementary: coding for Rhodotorula gracilis D- 



amino acid oxida 



30 



35 



40 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



promoter 
(2071) .. (3882) 

complementary : A-thaliana nitrilase I promoter 



(3972) . . (4176) 

complementary: 35S terminator 



mi s c_f e a tur e 
(4251) .. (6248) 

complementary: coding for beta-glucuronidase 
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<220> 
<221> 
<222> 
<223> 



promoter 
(6302) . . (7619) 

complementar . coding for sTPT promoter 



10 



15 



20 



25 



30 



35 



40 



<220> 
<221> 
<222> 
<223> 



mi s c_f e a t ur e 
(7691) . . (7907) 
Agrobacterium left border 



<400> 16 



aaaccttttc 
tttacccgcc 
atcagatctg 
aagtatttta 

• 

aaccctataa 
aatagagaga 
tgtctcacga 
tgatggatat 
atcaattcac 
aacatagatg 
atcgcgtatt 
gtcatgcatt 
atcatcgcaa 
cgatcgggga 
atctagagct 
aacgcctcgt 
gcactcgaga 
ctgcccctgc 
gcctcaacgc 
tcgatgcctt 
aggatccgct 
ccgccgcaga 
gacgagtcca 
ggctcggcgg 
accaccaaat 



acgccctttt 



cagcgcgact 
aaatatccga 



gtacccggtc actggatttt 



gaaccctaat 
gata*gatttg 
cgttttgaac 
ctgcagaatt 
tggccgtcgt 
acaccgcgcg 
aaatgtataa 



tcccttatct 



ccagattacc 
cgccctttta 
tttacaacga 



tatcatggac 
ttattctaat 
gatagtttaa 
ggttttagga 
gggtttctta 
gggaactact 
ctggtgattt 
ctgttatccc 
gatcagcaca 
ctcagagctt 
tcctagtttg 



atacaaatgg acgaacggat 
aaacgctctt ttctcttagg 
actgaaggcg ggaaacgaca 
attagaaatt ttattgatag 
tatgctcaac acatgagcga 



gagagggtgt 
cggtaatttg 
aaccgcctcg 



gtccaattcg 
tcgcgcgcga 



gaccggcaac 
tcatccgggt 
tgggtcccgc 
cgacgagctg 
agccatacgc 
cgagcgagag 
ggggtccgcc 
cgatcgttcc 
ggaccgtctc 
tgacttcgcc 
tcgtgcatcg 
cttggtcgtc 
ccgcaccgtc 
gcttctgcag 
cgtaggttac 
gcgtgatgtc 
tccccttgag 
attcttccca 
cgccagccca 
gaatatgcac 



ttgcgggact 

tattacatgc 

aggattcaat 

ctgtggcggg. 

ctacaacttc 

cgcgacatcc 

atgcacaagc 

gggcgacttt 

tcgtcgtgca 

gtcgctcgag 

tgggttgaca 

acctggtcgg 

cttgcatggg 

gatgcccgca 

gaacgcctgc 

ctctcttgca 

gccgatagcg 

cttgtaccag 



cttaagaaac 
aactccacga 
gactcccgcg 
tccgccgcgc 
gtgacctcct 
gtccggtcga 
ggtcgcaagc 
atggtcgggt 
gacaagtccc 
ggaatgatgt 



atcgacttgg 



aggtactggc 



ttttgcttgt 



gctgtagccc 



45 ccgataacgc ctgatccgag gacaacgacg 



tgcccgagca 
tggcccgtcg 
cgaggaccgt 
aaagtctggc 
ttccgagcga 
cgcttctgcg 



ttgcgccggg 
tagtcgagcg 
ctggcggccg 
gacaggaggc 
cgcgctatat 
aaacccatct 
tcaacagaaa 
tttattgcca 
aaatatccga 
ccgcgccgtg 
cccaactctg 
tctccttcgc 
gaggcaggac 
cgacgttgtg 
cgaggcgcaa 
agtctcccac 
aggcgggaga 
ggacggtttg 
cgccaagtcc 
taacggtccg 
agtactttgg 
attcggaaga 
agccgtcttc 



taccocaaac 
gccgccagtg 
ttactagtgg 
ccgatctagt 
tttgttttct 



ttatatgata 



ctgtaagcgt 
tcgagacgtc 
ggatgagggc 
agtgcatggg 



acgcagcaag 
gtaccgctgg 
ctggtatccc 
cgctcgtgcg 
gatccgttct 
gcggaggacc 
gcagtgcttg 
gccgtacgtc 
agcggggtcg 
cccgcggatt 
cgtagcgttg 
tctctcaaac 
tgcgtggacg 
tgggaggggg 
gttctgcgcg 
gacccacttc 
catgaaaggc 
ctccggcaag 
gctgctcaga 
ccctcgacta 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
'1980 
2040 
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gagtcgagat ccgatatcgc ccgggctcga 
tcagagactt gagagaagca atatatagac 
tattttccta ttggagacgg taacgaagat 
gactagccca cgtggtttct tttctctgca 
5 cacatcaaac ctactaataa ttgagaagaa 
cgatgcttta tgtctgaata taaaaaagaa 
ggtttacaaa gtatttggac acaataaatc 
caaataaaaa acaaataaaa aacttaaaag 
tattctttta tggattcctt aaatccatat 

10 ataatatata ctatgttata gatatgtttc 
gtacgattct acctaataat attgaacgga 
ttttttttgg tttgtcttag ctgtatttca 
aaataaaatt tgacgtatag attttttaaa 
tgagtttggc tcctatttaa tcttaggatg 
. 15 tctcagagaa actggattaa ataaactaaa 
atatctttgg atttactatt attggcgaaa 
acaagagaaa tcaaaatttt tgaattaaca 
agttcgcatc aagaaaacca aaccccaatt 
cattgcagct ttcagataaa agaaaaaaac 

20 tacttgcagg aactcaaatt ccactttgcc 
aatgaaacga aatagtcttt ttatgcaaac 
a*ttgtgacag ccatccatat atatagggaa 
acgtaatggt ttagcatagc ttctattttc 
trtaagaaaaa taaattaata tatgtatatt 

25 acttaattat taaacattct aaatgcaaat 
tgaaatcaaa gccagattca taataattgg 
acatccagaa ttgaacactc aaattggatg 
ttggttctct tccattatta acgaaaataa 
.'ctcaacataa cattttgaac ttgactcctt 

30 accgattact tgtttgagtc atcttccgct 
ttcttcgggrt gcfccatttac ggtcaagtgt 
gataacaggg taatctgggt* tcaaaacgtc 
agctcggtac ccggtcactg gattttggtt 
attttacaaa -tacaaataca tactaagggt 

35 ctataagaac cctaattccc ttatctggga 
gagagagata gatttgtaga gagagactgg 
gcaattcccg aggctgtagc cgacgatggt 
cctccctgct gcggtttttc accgaagttc 
gccgccgacrt tcggtttgcg gtcgcgagtg 

■ 

40 aatatgcctt gcgaggtcgc aaaatcggcg 
ctgacgcgat caaagacgcg gtgatacata 
cacatgtcgg tgtacattga gtgcagcccg 
ataatcggct gatgcagttt ctcctgccag 
gccgtttcca aatcgccgct ttggacatac 

45 tcaaagagat cgctgatggt atcggtgtga 



gtctttgttt 


tttactttgg ttcatgacac 


2100 


ttttttttgt 


tttttttttg tggtcacgtt 


2160 


cgaacctgtg 


gtggaaatga aacaaggtgg 


2220 


ttgatttgtt 


tttgtttttt ttgtaaagtt 


2280 


aaataaaatc tattgattga ttaaaccagc 


2340 


crtcraaaaccc 

mm mjm mm* ^^fc mm* mm* mm* 


cgtttaagaa 


ttacaacggt 


2400 


caaaccraaat 

mm* \m\ \m* mm* Wfl *k U U 


aaaacaaaat 


ggagaactac 


2460 


aatttattec 

^4 W mm W Bb mm 


attttttttc 


ccgtagaatt 


2520 


ttoatocatt 


ttgattcctc ataataggta 


2580 


taattccrtat 


taacctacct 


ttttttggtc 


2640 


attoratcrttt 


tggaccactt 


agaaagtatt 


2700 


ttaaatataa 


atttaaataa 


gaaatgtcat 


27 60 


tccattttat 


gttatttaat 


atttgaaatg 


2B20 


acrttaatact 


aagttttcct taatgaatta 


2880 

m^ mm mm 


aaatagatca 


atgtgttttg gtccggtcaa 


2940 


aaaaacrtctc 


atatagtaaa 


tcatattcct 


3000 


taaattartat 


agtttcttat 


ataaccaatt 


3060 


aataatcaaa 


cgggcttggt 


aggaatattt 


3120 


acacactcaa 


gtcttttatt 


tcatctttct 


3180 


acttttcttt 

*A mm *^ mm mm 


acaaafaaac 


acaaattgtc 


3240 


actcrtttcrbc 

w mmw *^ mm mm* 


ttttttcgat 


cacgtttctg 


3300 


t gtaaaacaa ' 


caacatgtga 


agtcacatat 


3360 


gttgtcaata 


ttagtcattc 


caaaacattt 


3420 


cttcrcraacta 


atgtatgtgg 


aaatacagta 


3480 


atcrcaaaaaa 

mm* mm mmj mm**mm±*^*-****- m^ mm*^^* 


aaaaaagaaa 


agaacacaac 


3540 


ctacatggtt 


gtagaatgta 


gggtaacaca 


3600 


atacratcrcfat 

mm* mm mm* mm mm* mm -mmj mmj mm* mm 


aatctttaga 


tacaagagaa 


3660 


agaaaaaaag 


tttagcataa 


aagtttgaaa 


3720 


catacrcxaartci 


acatgaactg 


acgaatcaca 


3780 


ttctccacct 


tcgaaatgaa 


tgtgaccggt 


3840 


aaaacatctg 


gtctcgaggt 


acctggtagg 


3900 


gtgagacagt 


ttggtgcagg tcgaaattcg 


3960 


ttaggaatta 


gaaattttat 


tgatagaagt 


4020 


ttcttatatg 


ctcaacacat 


gagcgaaacc 


4080 


actactcaca 


cattattctg 


gagaaaaata 


4140 


tgatttttgc 


gccgggtacc 


gagctcggrta 


4200 


gcgccaggag 


agttgttgat 


tcattgtttg 


4260 


atgccagtcc 


agcgtttttg 


cagcagaaaa 


4320 


aagatccctt 


tcttgttacc 


gccaacgcgc 


4380 


aaattccata 


cctgttcacc 

• 


gacgacggcg 


4440 


tccagccatg 


cacactgata 


ctcttcactc 


4500 


gctaacgtat 


ccacgccgta 


ttcggtgatg 


4560 


gccagaagtt 


ctttttccag 


taccttctcrt 


4620 


catccgtaat aacggttcag 


gcacagcaca 


4'680 

■ 


gcgtcgcaga 


acattacatt 


gacgcaggtg 


4740 
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atcggacgcg tcgggtcgag tttacgcgtt 
acttgcggac gggtatccgg ttcgttggca 
ttgtcacgcg ctatcagctc tttaatcgcc 
actgcctctt cgctgtacag ttctttcggc 
5 gagaggttaa agccgacagc agcagtttca 
cagtcgagca tctcttcagc gtaagggtaa 
cagtccatta atgcgtggtc gtgcaccatc 
tccgcatctt catgacgacc aaagccagta 
tgttcgccct tcactgccac tgaccggatg 

10 gtctggcttt tggctgtgac gcacagttca 
tgcggattca ccacttgcaa agtcccgcta 
tccgcatcac gcagttcaac gctgacatca 
gcgtggttac agtcttgcgc gacatgcgtc 
ggcgtggtgt agagcattac. gctgcgatgg 
. 15 taagactgct ttttcttgcc gttttcgtcg 
cagttcagtt cgttgttcac acaaacggtg 
tattagaaaa gttataaatt aaaatataca 
tatactacat tcttttattt tgaaaaaaat 
gataattatt atatatatat. caaaggtaga 

20 ataacatacg gcgtgacatc ggcttcaaat 
tcctgattat tgacccacac tttgccgtaa 
cgctggcctg. cccaaccttt . cggtataaag 
taattacgaa tatctgcatc. ggcgaactga 
cggctttctt gtaacgcgct ttcccaccaa 

25 agactgaatg cccacaggcc gtcgagtttt 
cgrtaccatgg tcgatcgact ctagactagt 
atgaaatcga aattcagagt tttgatagtg 
tcgagtattt caagagatgg tacttgttga 
aatccagtat ctcgcggtgt atggagttat 

30 gtagagaatg atgtaattag ataagaatct 
gggtccatct tatcttgata agtggatggt 
gaagttccat atcaccatca tcatctgtat 
.tgaaattttg gtagatttta tttttcatta 
tattgaacgt gacagattca caagaaagta 
• 35 taaaggtatc tctatttaat ttcttttgtt 
tttcatgctc atagtcatat acgttattct 
cgtaatacaa ttttataatt ttatcaaagt 
tacaaaacaa aaaaatcttc taattttttt 
aaaaatacaa atgaacccac atgtattatg 

40 gtactcaatt caccctatac tctttttttt 
ttctgtccat ttgaccaaat tcaattaatt 
atgttcacta tttctcgatt tgagggaacc 
atcactacaa atatgttata gtttatactt 
attaagtaaa catgattctt cttatgaagt 

45 gtgattgatg ttattggttg attgagtgat 



gcttccgcca 


grtggcgaaat 


attcccgtgc 


4 O UU 


atactccaca tcaccacgct 


tgggtggttt 


48 oQ 


tgtaagtgcg 


ettgetgagt 


ttccccgttg 


4±?ZU 


ttgttgcccg cttcgaaacc aatgcctaaa 


4 y o u 


tcaat caeca egatgecatg ttcatctgcc 


OU4 U 


tgegaggtae 


ggtaggagtt 


ggccccaatc 


510U 


ageaegttat 


cgaatccttt 


gecaegtaag 


5160 


aagtagaacg gtttgtggtt aatcaggaac 


c o o n 

5220 


ccgacgcgaa 


gegggtagat 


atcacactct 


C" O O f*\ 

5280 


tagagataac cttcacccgg ttgecagagg 


5340 


gtgccttgtc cagttgeaac 


cacctgttga 


5400 


ccattggcca 


ccacctgcca 


gtcaacagac 


54 60 


accacggtga tatcgtccac ccaggtgttc 


5520 


attceggcat 


agttaaagaa 


atcatggaag 


5580 


gtaatcacca ttcccggcgg gatagtctgc 


5d40 


atacctgcac 


atcaacaaat 


tttggtcata 


c ~t r\ n 

5700 


cacttataaa 


ctacagaaaa 


geaattgeta 


57 60 


atttgaaata 


ttatattact 


actaattaat 


5820 


agcagaaact 


.tacgtacact 


tttcccggca 


5880 


ggegtatage 


cgccctgatg 


ctccatcact 


5940 


tgagtgaccg 


catcgaaacg 


cagcacgata 


J-V f\ f\ 

6000 


acttcgcgct 


gataccagac 


gttgcccgca 


6060 


tegttaaaac 


tgcctggcac 


ageaattgee 


mm P\ 

6120 


.cgctgaccaa 


ttccacagtt" 


ttcgcgatcc 


6180 


ttgatttcac 


gggttggggt 


ttctacagga 


6240 
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gttcttatga ggtaatttgt tactcattca tccttttgca tatgagaaat tgtgttagcg 7500 

tacgcaaaac aatagagaac ataaaagata tgtgtattta tttaaggtga cttttgttaa 7560 

tgatattgta gtatctatac atttatatat aacttgttga atttgagtat aagctatcag 7620 

gatccggggg atcctctaga gtcgaggtac cgagctcgaa ttcactggcc gtcgttttac 7680 

5 aacgactcag tactgcttgg taataattgt cattagattg tttttatgca tagatgcact 7740 

cgaaatcagc caattttaga caagtatcaa acggatgtta attcagtaca ttaaagacgt 7800 

ccgcaatgtg ttattaagtt gtctaagcgt caatttgttt acaccacaat atatcctgcc 7860 

accagccagc caacagctcc ccgaccggca gctcggcaca aaatcaccac gcgtctaaaa 7920 

aggtgatgtg tatttgagta aaacagcttg cgtcatgcgg tcgctgcgta tatgatgcga 7980 

10 tgagtaaata aacaaatacg caaggggaac gcatgaaggt tatcgctgta cttaaccaga 
aaggcgggtc aggcaagacg accatcgcaa cccatctagc ccgcgccctg caactcgccg 

gggccgatgt tctgttagtc gattccgatc cccagggcag tgcccgcgat tgggcggccg 8160 

tgcgggaaga tcaaccgcta accgttgtcg gcatcgaccg cccgacgatt gaccgcgacg 8220 

tgaaggccat cggccggcgc gacttcgtag tgatcgacgg agcgccccag gcggcggact B2B0 

15 . tggctgtgtc cgegatcaag gcagccgact. tcgtgctgat tccggtgcag ccaagccctt 8340 

acgacatatg ggccaccgcc gacctggtgg agctggttaa gcagcgcatt gaggtcacgg B400 

atggaaggct acaagcggcc tttgtcgtgt cgcgggcgat caaaggcacg cgcatcggcg 84 60 

gtgaggttgc . cgaggcgctg gccgggtacg. agctgcccat tcttgagtcc cgtatcacgc 8520 
agcgcgtgag ctacccaggc actgccgccg ccggcacaac cgttcttgaa tcagaacccg • 8580 

20 agggcgacgc tgcccgcgag gtccaggcgc tggccgptga aattaaatca aaactcattt 8640 

gagttaatga ggtaaagaga aaatgagcaa aagcacaaac. acgctaagtg ccggccgtcc 8700 
gagcgcacgc agcagcaagg. ctgcaacgtt ggccagcctg gcagacacgc cagccatgaa • 8760 

gcgggtcaac tttcagttgc cggcggagga tcacaccaag ctgaagatgt .acgcggtacg 8820 

ccaaggcaag accattaccg- agctgctatc tgaatacatc gcgcagctac cagagtaaat 8880 

25 gagcaaatga ataaatgagt agatgaattt. tagcggctaa aggaggcggc atggaaaatc ■ 894 0 

aagaacaacc aggcaccgac gccgtggaat gccccatgtg tggaggaacg ggcggttggc 9000 

caggcgtaag cggctgggtt gtctgccggc cctgcaatgg cactggaacc cccaagcccg 9060 

aggaatcggc gtgagcggtc gcaaaccatc * cggcccggta caaatcggcg cggcgctggg 9120 

tgatgacctg gtggagaagt tgaaggccgc gcaggccgcc cagcggcaac. gcatcgaggc 9180 

30 agaagacgcc ccggtgaatc gtggcaaggg gccgctgatc gaatccgcaa agaatcccgg 9240 

caaccgccgg cagccggtgc gccgtcgatt aggaagccgc ccaagggcga cgagcaacca 9300 

. gattttttcg ttccgatgct ctatgacgtg ggcacccgcg atagtcgcag catcatggac 9360 

gtggccgttt tccgtctgfcc gaagcgtgac cgacgagctg gcgaggtgat . ccgctacgag 9420 

cttccagacg ggcacgtaga ggtttccgca gggccggccg- gcatggccag tgtgtgggat 9480 

35 tacgacctgg tactgatggc ggtttcccat ctaaccgaat ccatgaaccg ataccgggaa 954 0 

gggaagggag acaagcccgg ccgcgtgttc cgtccacacg ttgcggacgt actcaagttc 9600 

tgccggcgag ccgatggcgg aaagcagaaa gacgacctgg tagaaacctg cattcggtta 9660 

aacaccacgc acgttgccat gcagcgtacg aagaaggcca agaacggccg cctggtgacg 9720 

gtatccgagg gtgaagcctt gattagccgc tacaagatcg taaagagcga aaccgggcgg 9780 

40' ccggagtaca tcgagatcga gctagctgat tggatgtacc gcgagatcac agaaggcaag 9840 

aacccggacg tgctgacggt tcaccccgat tactttttga tcgatcccgg catcggccgt 9900 

tttctctacc gcctggcacg ccgcgccgca ggcaaggcag aagccagatg gttgttcaag 9960 

acgatctacg aacgcagtgg cagcgccgga gagttcaaga agttctgttt caccgtgcgc 10020 

aagctgatcg ggtcaaatga cctgccggag tacgatttga aggaggaggc ' ggggcaggcrt 10080 

45 ggcccgatcc tagtcatgcg ctaccgcaac ctgatcgagg gcgaagcatc cgccggttcc 10140 
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taatgtacgg agcagatgct agggcaaatt gccctagcag gggaaaaagg tcgaaaaggfc 10200 

ctctttcctg tggatagcac gtacattggg aacccaaagc cgtacattgg gaaccggaac 10260 

ccgrtacattg ggaacccaaa gccgtacatt gggaaccggt cacacatgrta agtgactgat 10320 

ataaaagaga aaaaaggcga tttttccgcc taaaactctt taaaacttat taaaactctt 10380 

5 • aaaacccgcc tggcctgtgc ataactgtct ggccagcgca cagccgaaga gctgcaaaaa 10440 

gcgcctaccc ttcggtcgct gcgctcccta cgccccgccg cttcgcgtcg gcctatcgcg 10500 

gcctatgcgg tgtgaaatac cgcacagatg cgtaaggaga aaataccgca tcaggcgctc 10560 

ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc 10620 

agctcactca aaggcggtaa tacggrttatc cacagaatca ggggataacg caggaaagaa 10680 

10 catgtgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt 10740 

tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg 10800 

gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg 10860 

ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag 10920 

cgtggcgctt tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc 10980 

15 caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa 11040 

ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg 11100 

taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc 11160 

taactacggc tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac 11220 

cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg 1128 0 

20 • tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt 113.40 

gatcttttct acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt ' 11400 

catgcatgat atatctccca. atttgtgtag .-ggcttattat . gcacgcttaa. aaataataaa 114 60 

agcagacttg acctgatagt ttggctgtga gcaattatgt gcttagtgca tctaacgctt 11520 

gagttaagcc gcgccgcgaa gcggcgtcgg cttgaacgaa tttctagcta gacattattt 11580 

25 gccgactacc ttggtgatct cgcctttcac gtagtggaca aattcttcca actgatctgc 11640 

gcgcgaggcc aagcgatctt cttcttgtcc aagataagcc tgtctagctt caagtatgac 11700 

gggctgatac tgggccggca ggcgctccat tgcccagtcg gcagcgacat ccttcggcgc 11760 

gattttgccg gttactgcgc tgrtaccaaat gcgggacaac gtaagcacta catttcgctc 11820 

atcgccagcc cagtcgggcg gcgagttcca tagcgttaag gtttcattta gcgcctcaaa 11880 

30 ta'gatcctgt tcaggaaccg gatcaaagag ttcctccgcc gctgga-ccta ccaaggcaac 1194 0 

gctatgttct cttgcttttg tcagcaagat agccagatca atgtcgatcg tggctggctc 12000 

gaagatacct gcaagaatgt-. cattgcgctg .ccattctcca aattgcagtt cgcgcttagc .12060 

tggataacgc cacggaatga tgtcgtcgtg cacaacaatg-- gtgacttcta cagcgcggag 12120 

aatctcgctc tctccagggg aagccgaagt ttccaaaagg , tcgttgatca aagctcgccg 12180 

35 cgttgtttca tcaagcctta cggtcaccgt aaccagcaaa tcaatatcac tgtgtggctt 12240 

caggccgcca tccactgcgg agccgtacaa atgtacggcc agcaacgtcg gttcgagatg 12300 

gcgctcgatg acgccaacta cctctgatag ttgagtcgat acttcggcga tcaccgcttc 12360 

ccccatgatg tttaactttg ttttagggcg actgccctgc tgcgtaacat cgttgctgct 12420 

ccataacatc aaacatcgac ccacggcgta acgcgcttgc tgcttggatg cccgaggcat 12480 

40 agactgtacc ccaaaaaaac agtcataaca agccatgaaa accgccactg cgttccatg 12539 



